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ALIGNMENTS 



RESULT 1 
AAB73204 

ID AAB73204 standard; Protein; 115 AA. 
XX 

AC AAB73204; 
XX 

DT ll-MAY-2001 (first entry) 
XX 

DE TGF-betal C-terminal sequence. 
XX 

KW Gene therapy; growth differentiation factor-8; GDF-8; AIDS; cachexia; 

KW neurodegenerative disease; amyotrophic lateral sclerosis; obesity; 

KW muscular dystrophy; musculodegenerat ive disease; tissue repair; 

KW muscle wasting disease; neuromuscular disorder; spinal cord injury; 

KW traumatic injury; congestive obstructive pulmonary disease. 

XX 

OS Unidentified. 
XX 

PN WO200112777-A2 . 
XX 

PD 22-FEB-2001. 
XX 

PF 17-AUG-2000; 2 000WO-US22884 . 
XX 

PR 19-AUG-1999; 99US- 0378238 . 
XX 

PA (UYJO ) UNIV JOHNS HOPKINS SCHOOL MEDICINE. 
XX 

PI Lee S, McPherron AC; 
XX 

DR WPI; 2001-211209/21. 
XX 

PT New substantially purified growth differentiation factor-8 polypeptide, 

PT useful for treating muscle wasting disease, obesity, muscular 

PT dystrophy, neuromuscular disorder, acquired immunodeficiency syndrome 

PT and cachexia 

XX 

PS Example 2; Fig 3; 124pp; English. 
XX 

CC The present invention relates to growth differentiation factor-8 (GDF-8) 

CC coding sequences and proteins. The present sequence is a protein, which 

CC was used in a sequence homology comparison with the GDF-8 protein 

CC isolated in the present invention. GDF-8 is useful for treating 

CC neurodegenerative diseases (e.g. amyotrophic lateral sclerosis and 

CC muscular dystrophy) , musculodegenerative diseases or in tissue repair due 

CC to trauma, obesity and disorders related to abnormal proliferation of 

CC adipocytes. GDF-8 is also useful for treating malignancies of the various 

CC organ systems, particularly cells in muscle or adipose tissues and in 

CC gene therapy for the treatment of cell proliferative or immunological 

CC diseases mediated by GDF-8. In addition, GDF-8 is also useful for 

CC treating muscle wasting disease, neuromuscular disorder, spinal cord 

CC injury, traumatic injury, congestive obstructive pulmonary disease 

CC (COPD) , AIDS or cachexia. 

XX 

SQ Sequence 115 AA; 



Query Match 



93.4%; Score 640; DB 22; Length 115; 



Best Local Similarity 99.1%; Pred. No. 9.5e-63; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

M I I I I II II I III I M I II Ml II II III II III I M I I I I I I I M MM II I MM 

Db 3 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 62 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 120 

1 1 1 1 1 1 : 1 1 MM I h I II 1 1 1 1 1 1 III M I III I M M M II 1 1 1 i I II 

Db 63 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 115 

RESULT 2 
ABG76033 

ID ABG76033 standard; Protein; 115 AA. 
XX 

AC ABG76033; 
XX 

DT 30-APR-2003 (first entry) 
XX 

DE Human TGF-beta 1 C-terminus. 
XX 

KW GDF-5; growth differentiation factor 5; TGF-beta; human; 

KW transforming growth factor beta; skeletal development; endometriosis; 

KW cartilage differentiation; cell proliferative disease; uterine tumour; 

KW bone dysplasia; spondylopithyseal dysplasia; achondroplasia; 

KW dysplasia epiphysial is ; metaphyseal dysostosis; hyperchondroplsia ; 

KW enchondromatosis ; hypophosphatasia ; osteopetrosis ; hyperphosphatasia ; 

KW craniometaphyseal dysplasia; osteogenesis imperfecta; transgenic; 

KW idiopathic osteoporosis; Engelman 1 s disease; TGF beta-1. 

XX 

OS Homo sapiens. 
XX 

PN US2002165361-A1. 
XX 

PD 07-NOV-2002. 
XX 

PF 12-JUN-2001; 2001US-0880708 . 
XX 

PR 31-MAY-1995; 95US - 04 5555 9 . 

PR 01-SEP-1998; 98US-0145060 . 

PR 12-JAN-1993; 93US- 00 03 144 . 

PR 12-JAN-1994; 94WO-US00657 . 
XX 

PA (LEES/) LEE S. 

PA (HUYN/) HUYNH T. 
XX 

PI Lee S, Huynh T; 
XX 

DR WPI; 2003-246676/25. 
XX 

PT New antibody specifically binding to a GDF-5 polypeptide, useful for 

PT diagnosing and treating cell proliferative disorders with aberrant 

PT GDF-5 activity, such as endometriosis, uterine tumors and those 

PT involving skeletal tissues 
XX 

PS Example 2; Fig 3A; 36pp ; English. 



XX 

CC The invention relates to an antibody that specifically binds to growth 

CC differentiation factor-5 (GDF-5, a member of the TGF-Beta (transforming 

CC growth factor beta) super family of proteins) polypeptide appearing 

CC as ABG76018. In order to determine the biological activity of GDF-5 in 

CC vivo, transgenic mice were constructed that express GDF-5 ectopically. 

CC Analysis of two independent transgenic mouse lines showed that these 

CC animals have ectopic bone formation with evident muscle tissue. This 

CC showed that GDF-5 was capable of inducing bone formation in vivo. 

CC The antibody is useful for the diagnosis and treatment of cell 

CC proliferative disorders associated with aberrant GDF-5 activity, such 

CC as endometriosis, uterine tumors, those involving skeletal tissues, 

CC endometriosis, cartilage differentiation, cell proliferative disease, 

CC uterine tumour, bone dysplasia, spondylopithyseal dysplasia, 

CC achondroplasia, dysplasia epiphysial is , metaphyseal dysostosis, 

CC hyperchondroplsia , enchondromatosis , hypophosphatasia , osteopetrosis, 

CC hyperphosphatasia, craniometaphyseal dysplasia, osteogenesis imperfecta, 

CC idiopathic osteoporosis and Engelman's disease. The present sequence 

CC represents a member of the TGFbeta superfamily used to determine 

CC regions of sequence similarity for design of degenerate primers for 

CC isolation of GDF-5. 

XX 

SQ Sequence 115 AA; 

Query Match 93.4%; Score 64 0; DB 24; Length 115; 

Best Local Similarity 99.1%; Pred. No. 9.5e-63; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

U 1 1 IT ! M i II II 'I I II : I : II : 'II I IM, lllllllllll M MINI I II 

Db 3 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 62 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 12 0 

IMilllMIII MM III II, I. MIMI'll Mil, lllllllllli II 1 1 

Db 63 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 115 



RESULT 3 
AAP61468 

ID AAP61468 standard; Protein; 390 AA. 
XX 

AC AAP61468; 
XX 

DT 31-OCT-2002 (updated) 

DT 28-OCT-1991 (first entry) 

XX 

DE PreTGF-beta gene product. 
XX 

KW Transforming growth factor beta; cancer; wound healing. 
XX 

OS Unidentified. 
XX 

FH Key Location/Qualifiers 

FT Protein 279.. 390 

XX 

PN EP200341-A. 
XX 



PD 10-DEC-1986. 
XX 

PF 21-MAR-1986; 8 6EP- 03 02 1 12 . 
XX 

PR 22-MAR-1985; 85US- 07 15 142 . 

PR 13-MAR-1987; 8 7US- 0 02 5423 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA; 
XX 

DR WPI; 1986-326875/50. 

DR N-PSDB; AAN60 972 . 
XX 

PT TGF-beta prodn. from transformed hosts - useful esp . for treating 

PT wounds (J6 2/9/86) . 

XX 

PS Disclosure; Fig lb; 26pp; English. 
XX 

CC The gene product is known to stimulate cell proliferation and 

CC inhibit anchorage -dependent growth of a variety of human cancer cell 

CC lines, it is esp. useful in treatment of burns and the promotion of 

CC surface and internal wound healing. TGF-beta may be expressed from a 

CC transformed CHO cell line. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 
XX 

SQ Sequence 390 AA; 



Query Match 93.4%; Score 64 0; DB 7; Length 3 90; 

Best Local Similarity 99.1%; Pred . No. 3.8e-62; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

M 1 1 1 1 1 1 1 1 1 'li 1 1. 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II . 1 1 Mi II 1 1 II 1 1 1 1 1 1 1 1 >l II 1 ,1 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 12 0 

1 1 1 1 1 1 1 1 1 1 1 'll 1 1, 1 1 1 1 1 1 '1 1 1 1 1 1 1 M 1 1 1 'I I IN I M I ! 1 1 1 1 1 1 

Db 338 KVTALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 4 
AAR04034 

ID AAR04034 standard; protein; 390 AA. 
XX 

AC AAR04 034; 
XX 

DT 25-MAR-2003 (updated) 

DT 31-OCT-2002 (updated) 

DT 31-MAY-1989 (first entry) 
XX 

DE Sequence of pre-TGF-beta 1. 
XX 

KW Transforming growth factor beta-3 (TGF beta 3); tumour cells; growth 

KW inhibition. 

XX 

OS Homo sapiens . 



XX 

FH Key Location/Qualifiers 

FT CDS 348. .500 

XX 

PN WO8912101-A. 
XX 

PD 14-DEC-1989. 
XX 

PF 08-JUN-1988; 88WO-US01 945 . 
XX 

PR 08-JUN-1988; 88WO-US01945 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Dernyck RMA, Goeddel DV; 
XX 

DR WPI; 1990-007474/01. 
DR N-PSDB; AAQ02815. 
XX 

PT Nucleotide sequence encoding transforming growth factor beta-3 used as a 
PT probe, or to produce TGF beta 3, for inhibiting growth of certain normal 
PT and neoplastic cells, eg A549. 
XX 

PS Disclosure; Fig. 2; 61pp; English. 
XX 

CC Sequence is an exon of transforming growth factor-beta 1 (pre-TGF-beta 
CC 1) polypeptide and corresponds to AA's 288-338 of mature TGF-beta 1. The 
CC nucleic acid encoding second subtype of TGF-beta (TGF-beta 3) is useful 
CC as a probe or to produce TGF-beta 3 for inhibition of normal and 
CC neoplastic cell growth. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 
CC (Updated on 25-MAR-2003 to correct PR field.) 
CC (Updated on 25-MAR-2003 to correct PI field.) 
XX 

SQ Sequence 3 90 AA; 

Query Match 93.4%; Score 640; DB 11; Length 390; 

Best Local Similarity 99.1%; Pred. No. 3.8e-62; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

= 1 1 1 1 1 1 1 1 ! 1 1 ; 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 ! 1 1 1 . i r 1 1 i 1 1 1 1 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 12 0 

1 1 II I II 1 1 1 1 1 II 1 1 1 1 1 1 1 II II I II 1 1 II i 1 1: 1 1 1 1 1 1 1 1 II i 1 1 1 1 1 

Db 33 8 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 5 
AAR05258 

ID AAR05258 standard; protein; 390 AA. 
XX 

AC AAR05258; 
XX 

DT 25-MAR-2003 (updated) 

DT 05-AUG-1990 (first entry) 



DE Human pre- transforming growth factor-beta - 1 (pre-TGF-beta-1 ) . 
XX 

KW Transforming growth factor-beta- 1 (TGF-beta- 1) ; 

KW neoplastic cell line inhibition; 

KW EGF -potentiated anchorage- independent growth. 
XX 

OS Homo sapiens. 
XX 

FH Key 

FT Peptide 

FT Protein 

FT Domain 
FT 

FT Modif ied-site 
FT 

FT Modif ied-site 
FT 

FT Modif ied-site 
FT 

FT Cleavage-site 
FT 
XX 

PN US4886747-A. 
XX 

PD 12-DEC-1989. 
XX 

PF 13-MAR-1987; 8 7US - 002 5423 . 
XX 

PR 13-MAR-1987; 87US- 0025423 . 

PR 22-MAR-1985; 8 5US - 07 1514 2 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1990-051338/07. 

DR N-PSDB; AAQ93301. 
XX 

PT Nucleic acid encoding transforming growth factor-beta - 

PT cloned into expression vectors for expression in eukaryotic host 

PT cells for therapeutic use 

XX 

PS Disclosure; Fig lb; 28pp; English. 
XX 

CC Pre-TGF-beta-1 is a polypeptide of 390 amino acids. Post-transitional 

CC cleavage of the precursor gives rise to the mature TGF-beta monomer. 

CC The sequence for human TGF-beta was determined by direct amino acid 

CC sequence analysis and by deduction from the TGF-beta cDNA . It is 

CC capable of inducing EGF-potent iated anchorage -independent growth of 

CC target cell lines, and/or growth inhibition of neoplastic cell lines. 

CC can be used for treating wounds, eg burns or epidermal ulcers. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 390 AA; 

Query Match 93.4%; Score 64 0; DB 11; Length 3 90; 



Locat ion/Qua 1 if iers 
1. .278 
279. .2011 
8 . .23 

/note= M hydrophobic domain" 
82 . . 84 

/note= "potential N-glycosylat ion site " 
136. .138 

/note="as above" 
176 . . 178 

/note="as above" 
277. .278 

/note="proteolytic cleavage site" 



Best Local Similarity 99.1%; Pred. No. 3.8e-62; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 8 KALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I ! I I I I II I M I I I M ! I I I I I , I II I II I I I I I I I I I ! I I I I ; I I M'l I I I I I I || 
Db 278 RALDTNYCFSSTEKNCCWQLY I DFRKDLGWKWI HE PKGYHANFCLGPCPY I WSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

I I I I I II 
Db 338 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 3 90 



RESULT 6 




AAR13813 




ID 


AAR13813 standard; Protein; 390 AA. 


XX 






AC 


AAR13813 ; 




XX 






DT 


20-NOV-1991 


(first entry) 


XX 






DE 


Human pro-TGF 


-beta 1 . 


XX 






KW 


Osteogenetic; 


tumoricidal . 


XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Location/Qualif iers 


FT 


Peptide 


1. .29 


FT 




/note= "signal peptide" 


FT 


Peptide 


30.. 390 


FT 




/note= "pro-TGF-beta 1" 


FT 


Peptide 


279. .390 


FT 




/note= "TGF-beta 1" 


XX 






PN 


JP03180192-A. 




XX 






PD 


06-AUG-1991 . 




XX 






PF 


07-DEC-1989; 


89JP-0318243 . 


XX 






PR 


07-DEC-1989; 


89JP-0318243 . 


XX 






PA 


(KIRI ) KIRIN 


BREWERY KK. 


XX 






DR 


WPI; 1991-271579/37. 


DR 


N-PSDB; AAQ13392. 


XX 






PT 


Human pro-TGF 


-beta 1 prodn., for osteo-genetic activity - 


PT 


preparing DNA 


chain contg . base sequence coding for human 


PT 


pre :pro-TGF-beta 1, forming expression vector etc. 


XX 






PS 


Claim 1; Fig : 


1; 16pp; Japanese. 


XX 






cc 


The amino acid sequence codes for human prepro-TGF-beta 1 


cc 


can be produced by recombinant methods, it has osteogenet: 


cc 


tumoricidal activity . 


XX 







SQ Sequence 3 90 AA; 



Query Match 93.4%; Score 640; DB 12; Length 390; 

Best Local Similarity 99.1%; Pred. No. 3.8e-62; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCWQLYIDFRKJDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

H 1 1 1 h 1 1 1 1 1 1 M I II I II 1 1 II I M ! II 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 . 1 1 1 1 1 1 II 1 1 1 

Db 2 78 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 120 

I I , I I : I , I I I I I, < 'I I I i I ' I I I I II I I i I I I I I II II I II IN I I II 
Db 338 KVLALYNQHN PGAS AAPCCVPQALE PL P I VY Y VGRKPKVEQLSNM I VRS CKCS 390 



RESULT 7 


AAR20124 


ID 


AAR20124 standard; Protein; 390 AA. 


XX 




AC 


AAR2 0124 ; 


XX 




DT 


2 5 -MAR- 2 0 03 (updated) 


DT 


16-APR-1992 (first entry) 


XX 




DE 


Sequence of simian transforming growth factor (TGF) beta-1. 


XX 




KW 


Hypertension therapy; hypotensive agent; blood pressure modulator 


XX 




OS 


Monkey . 


XX 




FH 


Key Location/Qualifiers 


FT 


Peptide 8 . . 21 


FT 


Protein 279 . .390 


XX 




PN 


W09119513-A. 


XX 




PD 


26-DEC-1991. 


XX 




PF 


2 0-JUN-1991; 91WO-US04449. 


XX 




PR 


2 0-JUN-199 0; 90US- 054 1221 . 


XX 




PA 


(BRIM ) BRISTOL-MYERS SQUIBB CO. 


XX 




PI 


Oleson FB, Comereski CR; 


XX 




DR 


WPI; 1992-024199/03. 


DR 


N-PSDB; AAQ20289. 


XX 




PT 


Use of transforming growth factor (TGF) -beta and their 


PT 


antagonists - for modulating blood pressure, for treating 


PT 


hypertension and hypotension 


XX 




PS 


Disclosure; Fig 1; 42pp ; English. 


XX 




cc 


A new method for treating hypertension comprises administering a 


cc 


transforming growth factor (TGF) -beta to an individual at a dose 



CC effective for lowering blood pressure; the TGF-beta may be e.g. 

CC mature TGF-beta, TGF-beta2 / a mature TGF-betal/beta2 hybrid, TGF- 

CC betal precursor, a latent TGF-beta2 precursor, hybrid TGF-betal/TGF- 

CC beta2 precursor, a latent TGF-betal complex or a latent TGF-beta2 

CC complex. 

CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 390 AA; 

Query Match 93.4%; Score 640; DB 13; Length 390; 

Best Local Similarity 99.1%; Pred. No. 3.8e-62; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

1 : 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 : 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 mi I ii 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

I MM IIIIMIIIMI IIIMII III Mill 1:111 III II 1 1 MM I III 

Db 338 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 

RESULT 8 
AAR46227 

ID AAR46227 standard; Protein; 390 AA. 
XX 

AC AAR46227; 
XX 

DT 25-MAR-2003 (updated) 

DT 09-JUL-1994 (first entry) 

XX 

DE Human pre-TGF-beta-1 . 
XX 

KW TGF-beta- 1; TGF-beta-2; transforming growth factor beta-1; 

KW transforming growth factor beta-3; recombinant; wound healing; 

KW vulnerary. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT Peptide 279.. 390 

FT /label= Mat_peptide 

FT Cleavage-site 279 

FT /note= "TGF-beta-l release site" 

FT Modif ied-site 82.. 84 

FT /label= N-glycosylat ion_site 

FT Modif ied-site 136.. 138 

FT /label= N-glycosylat ion_site 

FT Modif ied-site 176.. 178 

FT /label= N-glycosylation_site 

XX 

PN US5284763-A. 
XX 

PD 08-FEB-1994. 
XX 

PF 04-MAR-1992; 92US- 0845893 . 
XX 



PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
XX 
SQ 



22-MAR-1985 
13-MAR-1987 
04-AUG-1989 
04-MAR-1992 



85US-0715142 . 
87US-0025423 . 
89US-0389929 . 
92US-0845893 . 



(GETH ) GENENTECH INC. 

Derynk RMA, Goeddel DV; 

WPI ; 1994-056343/07 . 
N-PSDB; AAQ56923. 

Nucleic acid sequences encoding transforming growth factor-beta - 
diagnostic probes, and for use in therapeutics 

Disclosure; Fig lb; 25pp; English. 

cDNA sequences were determined for human pre-TGF-beta-1 (AAQ56923) , 
pig TGF-beta-3 (AAQ56925) and human TGF-beta-3 (AAQ56926) , and the 
corresponding amino acid sequences were determined (AAR46227-29 , 
respectively) . A genomic fragment corresponding to a human TGF- 
beta-1 exon (AAQ56924) was also isolated and its amino acid sequence 
determined (AAR4 623 0) . The sequences have been used in the 
construction of vectors for the expression of recombinant TGF- 
beta . 

(Updated on 25-MAR-2003 to correct PF field.) 



Sequence 3 90 AA; 

Query Match 93 .4%; 

Best Local Similarity 99.1%; 
Matches 112; Conservative 



Score 64 0; DB 15; 
Pred. No. 3.8e-62; 
1; Mismatches 0; 



Length 390; 



Indels 



0 ; Gaps 



0; 



Qy 
Db 



8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II I I I I I I I i I ! 1 1 1 1 Tl 1 1 1 II 1 1 ! 1 1 1 1 II ! 1 1 1 

278 RALDTNYCFSSTEKNCCVRQLYIDFRPCDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 



Qy 

Db 



68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 12 0 

II 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 i I II 1 1 1 1 '1 1 1 1 II 1 1 1 1 1 1 

338 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 9 
AAR73596 

ID AAR73596 standard; Protein; 390 AA. 
XX 

AC AAR73596; 
XX 

DT 25-MAR-2003 (updated) 

DT 20-DEC-1995 (first entry) 

XX 

DE Human TGF-beta 1 protein. 
XX 

KW Transforming growth factor-beta; Human TGF-beta protein; TGF-beta 1; 

KW TGF-beta 3; osteogenic cell source; OCS; bone deficiency; 

KW bone- inducing cof actor. 
XX 



OS Homo sapiens. 
XX 

PN US5409896-A. 
XX 



PD 


25 


-APR- 


1995 . 






XX 












PF 


12 


-NOV- 


1993; 


93US- 


0132405 


XX 












PR 


01 


-SEP- 


1989; 


89US- 


0401906 


PR 


12 


-NOV- 


1991; 


91US- 


0790856 


PR 


18 


-MAY- 


1993; 


93US- 


0063841 


PR 


12 


-NOV- 


1993; 


93US- 


0132405 



XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Ammann AJ, Rudman CG; 
XX 

DR WPI; 1995-169610/22. 
XX 

PT Compsn. for treating skeletal tissue deficiency - comprising 

PT transforming growth factor-beta and an osteogenic cell source in a 

PT carrier 

XX 

PS Claim 3; Column 15-18; 19pp; English. 
XX 

CC This sequence represents human transforming growth factor-beta 1 

CC (TGF-beta 1) . This sequence and the sequence for human TGF-beta 3 (see 

CC AAR73598) are claimed within the scope of the invention. The invention 

CC is a composition consisting of a TGF-beta protein and an osteogenic cell 

CC source (OCS) formulated in an acceptable carrier other than a bone 

CC morphogenic cof actor. This composition can be used for the restoration 

CC of bone deficiency. This provides for the generation of mature bone 

CC only where it is required, without the inclusion of a specific 

CC bone- inducing cof actor. This method can be used with any of the 5 human 

CC TGF-beta' s or with TGF-beta from other species. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 3 90 AA; 



Query Match 93.4%; Score 64 0; DB 16; Length 3 90; 

Best Local Similarity 99.1%; Pred. No. 3.8e-62; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I I I I I I I I III I I I I I I I I I I I I I I I I I I I III I II I I i I I I I I I I I I I I I I : I II 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

1 1 1 1 1 1 1 1 1 1 1 '1 1 : 1 1 M 1 1 1 1 ' II ; 1 1 ! 1 1 ! I II 1 1 II 1 1 MM M I 1 1 

Db 338 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 3 90 



RESULT 10 
AAR90827 

ID AAR90827 standard; Protein; 390 AA . 
XX 

AC AAR90827; 



XX 

DT 25-MAR-2003 (updated) 

DT 25-JAN-1980 (first entry) 

XX 

DE Pre-transf orming growth factor beta 1. 
XX 

KW transforming growth factor beta 1; wound healing; 

KW recombinant production. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Domain 8 . . 23 

FT /note= "hydrophobic domain" 

FT Modif ied-site 82.. 84 

FT /note= "potential N-glycosylation site" 

FT Modif ied-site 136. .138 

FT /note= "potential N-glycosylation site" 

FT Modif ied-site 176.. 178 

FT /note= "potential N-glycosylation site" 

FT Cleavage-site 277.. 279 

FT /note= " trypsin-like peptidase cleavage site" 

FT Protein 279 . .390 

FT / label = matureJTGF_beta__l 
XX 

PN US5482851-A. 
XX 

PD 09-JAN-1996. 
XX 

PF 05-NOV-1993; 93US - 0 14 73 64 . 
XX 

PR 13-MAR-1987; 87US-0025423 . 

PR 22-MAR-1985; 85US - 07 1 514 2 . 

PR 04-AUG-1989; 89US-0389929 . 

PR 04-MAR-1992; 92US-0845893 . 

PR 05-NOV-1993; 93US - 0 14 7364 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1996-076891/08. 

DR N-PSDB; AAT15720. 
XX 

PT New recombinant human transforming growth factor-beta prods. - produced 

PT using Chinese hamster ovary cells , for use in diagnostic applications 

PT or in therapy 
XX 

PS Example 3; Fig 1A-C; 26pp; English. 
XX 

CC The pre-transforming growth factor (TGF) beta 1 protein is encoded 

CC by AAT15720. The mature TGF beta 1 monomer is cleaved from the 

CC precursor at the Arg-Arg dipeptide immediately preceding the mature 

CC TGF-beta 1 NH2 -terminus . It does not contain a recognisable N- terminal 

CC signal peptide typical of most secreted proteins. The pre -TGF beta 1 

CC contains several pairs of basic residues which could undergo 

CC post-translational cleavage and give rise to separate polypeptide 



cc 
cc 
cc 
cc 
cc 

XX 
SQ 



entities. The precursor contains 3 potential N-glycosylation sites, none 
of which are localised in the mature TGF beta 1. This is useful in 
purification of the mature protein. TGF beta 1 can be used in, e.g. wound 
healing. 

(Updated on 25-MAR-2003 to correct PF field.) 



Sequence 3 90 AA; 

Query Match 93 . 4%; 

Best Local Similarity 99.1%; 
Matches 112; Conservative 



Score 64 0; DB 17; 
Pred. No. 3.8e-62; 
1; Mismatches 0; 



Length 390; 



Indels 



0; Gaps 



0; 



Qy 

Db 

Qy 
Db 



8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYS 67 

M I lllllll IMIMIM Ml Ml Ml II II I II Ml I I MM I M M MM Ml II 

2 78 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYS 337 
68 KVLALYNQHN PGASAAP CC VPQALE PL P I VYYVGRKPKVEQLSNM I VRS CKCS 120 

MMMMMMMMMMMMMMMMMMMMMMMMMMI 

338 K VTiALYNQHN PGASAAP CCVPQALE PL PI VYYVGRKPKVEQLSNM I VRS CKCS 3 90 



RESULT 11 




AAW78785 




ID 


AAW78785 standard; Protein; 390 AA. 


XX 






AC 


AAW7878 5; 




XX 






DT 


25-MAR-2003 


( updated) 


DT 


21-DEC-1998 


(first entry) 


XX 






DE 


Human pre- transforming growth factor-beta 1. 


XX 






KW 


Transforming growth factor-beta 1; TGF-beta 1 


XX 






OS 


Homo sapiens . 




XX 






FH 


Key 


Location/Qualifiers 


FT 


Domain 


8 . .23 


FT 




/note= "hydrophobic domain" 


FT 


Protein 


279. .390 


FT 




/label= Mat protein 


FT 


Modif ied-site 


82.. 84 


FT 




/note= "Asn is N-glycosylated 


FT 


Modif ied-site 


136. .138 


FT 




/note= "Asn is N-glycosylated 


FT 


Modif ied-site 


176 . . 178 


FT 




/note= "Asn is N-glycosylated 


FT 


Cleavage-site 


277 . .278 


FT 




/note= "cleavage site for rel< 


XX 






PN 


US5801231-A. 




XX 






PD 


01-SEP-1998 . 




XX 






PF 


30-MAY-1995; 


95US-0454468 . 


XX 






PR 


13-MAR-1987; 


87US-0025423 . 



PR 22-MAR-1985; 85US- 0715142 . 

PR 04-AUG-1989; 8 9US- 038 992 9 . 

PR 04-MAR-1992; 92US- 084 58 93 . 

PR 05-NOV-1993; 93US-0 147364 . 

PR 30-MAY-1995; 95US- 04 54468 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1998-494840/42. 

DR N-PSDB; AAV52933. 
XX 

PT DNA encoding transforming growth factor-beta precursor sequence - 

PT useful for analysis to perform manipulations to increase yield of 

PT recombinant production of the protein 
XX 

PS Example 3; Fig IB 1-3; 26pp; English. 
XX 

CC This is the amino acid sequence of human transforming growth 

CC factor-beta 1 precursor (preTGF-beta 1) . It was deduced from 

CC a preTGF-beta 1 cDNA sequence (see AAV52933) . The invention relates 

CC to the recombinant production of TGF-beta. Biologically active 

CC TGF-beta is defined as being capable of inducing EGF-potent iated 

CC anchorage independent growth of target cell lines and/or growth 

CC inhibition of neoplastic cell lines. Nucleic acids encoding 

CC TGF-beta have been isolated and cloned into vectors which are 

CC replicated in bacteria and expressed in eukaryotic cells. TGF-beta 

CC recovered from transformed cells is used in known therapeutic 

CC applications. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 3 90 AA; 

Query Match 93.4%; Score 640; DB 19; Length 390; 
Best Local Similarity 99.1%; Pred. No. 3.8e-62; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 12 0 

I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 338 KVTLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 3 90 

RESULT 12 
AAM39186 

ID AAM39186 standard; Protein; 390 AA. 
XX 

AC AAM39186; 
XX 

DT 22-OCT-2001 (first entry) 
XX 

DE Human polypeptide SEQ ID NO 2331. 
XX 



KW Human; nootropic; immunosuppressant; cytostatic; gene therapy; cancer; 

KW peripheral nervous system; neuropathy; central nervous system; CNS; 

KW Alzheimer's; Parkinson's disease; Huntington's disease; haemostatic; 

KW amyotrophic lateral sclerosis; Shy-Drager Syndrome; chemotactic; 

KW chemokinetic; thrombolytic; drug screening; arthritis; inflammation; 

KW leukaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO200153312-A1 . 
XX 

PD 26-JUL-2001. 
XX 

PF 26-DEC-2000; 2 0 00WO-US34263 . 
XX 

PR 21-JAN-2000; 2 000US- 0488725 . 

PR 25-APR-2000; 2 000US- 05523 17 . 

PR 09-JUL-2000; 2 0 00US - 0598 042 . 

PR 19-JUL-2000; 2 0 00US- 062 03 12 . 

PR 03-AUG-2000; 2 0 00US- 065345 0 . 

PR 14-SEP-2000; 2 0 00US- 0662 19 1 . 

PR 19-OCT-2000; 2 0 00US- 0693 03 6 . 

PR 29-NOV-2000; 2000US- 0727344 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Asundi V, Chen R , Ma Y, Qian XB, Ren F, Wang D; 

PI Wang J, Wang Z, Wehrman T, Xu C ( Xue AJ, Yang Y, Zhang J; 

PI Zhao QA, Zhou P, Goodrich R, Drmanac RT; 
XX 

DR WPI; 2001-442253/47. 

DR N-PSDB; AAI58342. 
XX 

PT Novel nucleic acids and polypeptides, useful for treating disorders 

PT such as central nervous system injuries - 

XX 

PS Example 4; SEQ ID NO 2331; 10078pp; English. 
XX 

CC The invention relates to human nucleic acids (AAI 57798 -AAI61369 ) and 

CC the encoded polypeptides (AAM38642 -AAM42213 ) with nootropic, 

CC immunosuppressant and cytostatic activity. The polynucleotides are useful 

CC in gene therapy. A composition containing a polypeptide or polynucleotide 

CC of the invention may be used to treat diseases of the peripheral nervous 

CC system, such as peripheral nervous injuries, peripheral neuropathy and 

CC localised neuropathies and central nervous system diseases, such as 

CC Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 

CC lateral sclerosis, and Shy-Drager Syndrome. Other uses include the 

CC utilisation of the activities such as: Immune system suppression, 

CC Activin/inhibin activity, chemotact ic/chemokinet ic activity, haemostatic 

CC and thrombolytic activity, cancer diagnosis and therapy, drug screening, 

CC assays for receptor activity, arthritis and inflammation, leukaemias and 

CC C.N.S disorders. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification. 

XX 

SQ Sequence 3 90 AA; 



Query Match 93.4%; Score 640; DB 22; Length 3 90; 

Best Local Similarity 99.1%; Pred. No. 3.8e-62; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 



8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 



Db 




Qy 



68 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 



Db 




RESULT 13 
AAU77101 

ID AAU77101 standard; Protein; 390 AA. 
XX 

AC AAU77101; 
XX 

DT 05-JUN-2002 (first entry) 
XX 

DE Human transforming growth factor beta 1 (TFG-beta-1) polypeptide. 
XX 

KW Human; transforming growth factor beta; TGF-beta; insulin production; 

KW type I diabetes mellitus; pancreatic cell outgrowth; wound healing; 

KW pancreatic duct tissue; ischaemia; stroke; nervous system aging; 

KW neurological condition; neurodegenerative disease; inflammation; 

KW vasal injury; chemical injury; traumatic injury; tumour- induced injury; 

KW amyotrophic lateral sclerosis; spinocerebellar degeneration; 

KW immunological disease; multiple sclerosis; TGF-beta-1. 

XX 

OS Homo sapiens. 
XX 

PN WO200212336-A2. 
XX 

PD 14-FEB-2002. 
XX 

PF 09-FEB-2001; 2001WO-US04192 . 
XX 

PR 09-AUG-2000; 2 0 0 0US - 063 53 68 . 
XX 

PA ( CUR I - ) CURIS INC. 
XX 

PI Wang M, Pang K; 
XX 

DR WPI; 2002-257468/30. 
XX 

PT Treating a subject with a disorder resulting from insufficient insulin 

PT production, and inducing outgrowth of pancreatic cells, involves using 

PT a transforming growth factor beta therapeutic 
XX 

PS Disclosure; Fig 1; 77pp ; English. 
XX 

CC The invention relates to treating a subject with a disorder resulting 

CC from insufficient insulin production, involving contacting the subject 

CC with a transforming growth factor beta (TGF-beta) therapeutic. TGF-beta 

CC polypeptides can be used for treating a subject with a disorder resulting 



CC from insufficient insulin production, e.g. type I diabetes mellitus, and 

CC for inducing outgrowth of pancreatic cells associated with pancreatic 

CC duct tissue within a subject. A composition comprising a TGF-beta protein 

CC may be useful in wound healing and treatment of neurological conditions 

CC derived from acute, subacute or chronic injury to the nervous system, 

CC including traumatic injury, chemical injury, vasal injury and deficits 

CC (such as ischaemia resulting from stroke) , together with 

CC infectious/inflammatory and tumour- induced injury, aging of the nervous 

CC system including Alzheimer's disease, chronic neurodegenerative diseases 

CC including Parkinson's disease, Huntington's chorea, amyotrophic lateral 

CC sclerosis, spinocerebellar degenerations and chronic immunological 

CC diseases of the nervous system or affecting the nervous system, including 

CC multiple sclerosis. This sequence represents the human TGF-beta-1 

CC protein. 

XX 

SQ Sequence 3 90 AA; 

Query Match 93.4%; Score 64 0; DB 23; Length 3 90; 

Best Local Similarity 99.1%; Pred. No. 3.8e-62; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLY I DFRKDLGWKW I HEPKGYHANFCLGPCP YI WSLDTQYS 67 

HI II I II I I I I I I I i II I II I II. I I I I I I I I I I II Ml I II II II ' I I I I I M M I I 
Db 278 RALDTNYCFSSTEKNCCVRQLY I DFRKDLGWKW I HEPKGYHANFCLGPCPYI WSLDTQYS 337 

Qy 68 KVLALYNQHN PGASAAPCCVPQALE PLP I VYYVGRKPKVEQLSNM I VRS CKCS 120 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 338 KVTjALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 3 90 

RESULT 14 
AAE13596 

ID AAE13596 standard; Protein; 390 AA. 
XX 

AC AAE13596; 
XX 

DT 26-FEB-2002 (first entry) 
XX 

DE Porcine transforming growth factor beta 1 (TGF-betal) mutant. 
XX 

KW Porcine; transforming growth factor beta 1; TGF-betal; gene therapy; 

KW IBD; inflammatory bowel disease; autoimmune disease; immunosuppressive; 

KW multiple sclerosis; rheumatoid arthritis; systemic lupus erythematosus; 

KW diabetes mellitus; sarcoidosis; psoriasis; dermatological ; mutant; 

KW mutem. 
XX 

OS Sus scrofa. 
XX 

FH Key Location/Qualifiers 

FT Misc-dif f erence 223 

FT /note= "Wild type Cys substituted with Ser" 

FT Misc-dif f erence 225 

FT /note= "Wild type Cys substituted with Ser" 

XX 

PN WO200181404-A2 . 
XX 

PD 01-NOV-2001. 



XX 

PF 20-APR-2001; 2001WO-US12980. 
XX 

PR 20-APR-2000; 2000US-199014P . 
XX 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICES. 
XX 

PI Strober W, Nakamura K, Kitani A, Fuss IJ; 
XX 

DR WPI; 2002-026155/03. 

DR N-PSDB; AAD22 696. 
XX 

PT Composition for treating autoimmune diseases e.g. inflammatory bowel 

PT disease in humans, comprises vector containing transforming growth 

PT factor-beta under the control of inducible promoter 
XX 

PS Example 1; Fig 1; 78pp; English. 
XX 

CC The invention relates to a composition containing a vector comprising a 

CC gene encoding a regulatory transcription factor under the control of a 

CC promoter encoding a transforming growth factor-beta (TGF-beta) . The 

CC vector is useful for expressing TGF-beta, such as TGF-betal, TGF-beta2 

CC or TGF-beta3, its variants or homologues, by transfecting a cell which 

CC is part of a host suspected of having an autoimmune disease, especially 

CC inflammatory bowel disease (IBD), under conditions such that the 

CC polypeptide encoded by the nucleic acid sequence in the vector is 

CC expressed. The vector is delivered using a delivery system. The delivery 

CC of the vector results in substantial elimination of symptoms of the 

CC autoimmune disease and increased production of IL-10 by the host. The 

CC composition is useful for treating various diseases with an autoimmune 

CC component such as multiple sclerosis, rheumatoid arthritis, systemic 

CC lupus erythematosus, insulin-dependent diabetes mellitus, sarcoidosis 

CC and psoriasis, and also for assaying the expression of a gene in a cell. 

CC The vector is further useful for screening of the effect of test 

CC compounds on cytokine (e.g. TGF-beta) expression of transfected cells. 

CC The present sequence is porcine TGF-betal mutant. 

XX 

SQ Sequence 3 90 AA; 



Query Match 93.4%; Score 640; DB 23; Length 390; 

Best Local Similarity 99.1%; Pred. No. 3.8e-62; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I L 1 1 1 1 1 1 1 ! M II ! I M 1 1 1 1 II Mhl I 1 1 II I II 1 1 1 1 1 1 1 li 1 1 1 1 1 1 1 i III 

Db 2 78 RALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHN PGASAAPCCVPQALE PLP I VY YVGRKPKVEQLSNM I VRS CKCS 12 0 

I Ml 1 1 1! 1 1 ' 1 1 1 1 1 1 1 ! 1 1 1 1 II 1 1 1 II I 1 1 1 1 1 1 1 II I II 1 1 II 1 1 1 1 

Db 338 KVLALYNQHNPGASAAPCCVPQALEPLPIVY YVGRKPKVEQLSNM I VRS CKCS 390 



RESULT 15 
AAE16943 

ID AAE16943 standard; Protein; 390 AA. 
XX 

AC AAE16943; 



XX 

DT 18-APR-2002 (first entry) 
XX 

DE Human transforming growth factor-betal (TGF-betal) protein. 
XX 

KW Human; transforming growth factor-betal; TGF-betal; osteoporosis; 

KW latency associated peptide; LAP; mtegrin alphavbeta3; apoptosis; 

KW immunomodulation; inflammatory disease; fibrotic disease; cancer; 

KW diabetic retinopathy; chronic obstructive pulmonary disorder; 

KW bone resorption; rheumatoid arthritis; psoriasis; restenosis; 

KW atherosclerosis; liver fibrosis; asthma; cytostatic; osteopathic; 

KW ophthalmological ; ant iarterioscl erotic ; vasotropic . 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT Peptide 1 . .29 

FT /label= Signal_peptide 

FT Region 30 . . 278 

FT /note= "LAP-betal" 

FT Domain 244 . . 246 

FT /note= " RGD motif" 

FT Protein 279 . .390 

FT /note= "Human mature TGF-betal protein" 

XX 

PN WO200190186-A1. 
XX 

PD 29-NOV-2001. 
XX 

PF 25-MAY-2001; 2001WO-GB02352 . 
XX 

PR 26-MAY-2000; 2000GB-0012991 . 

PR 05-JAN-2001; 2001GB- 0000286 . 
XX 

PA (GLAX ) GLAXO GROUP LTD . 
XX 

PI Ludbrook S, Barry S, Horgan C, Miller D; 
XX 

DR WPI; 2002-097645/13. 
XX 

PT Identifying modulators of interactions between latency associated 

PT peptides and integrin alphavbeta3 for therapeutics, by contacting the 

PT peptide and integrin with a test product and determining if the product 

PT modulates interaction - 

XX 

PS Disclosure; Page 37-39; 44pp; English. 
XX 

CC The invention relates to a method for identification of a modulator 

CC of the interaction between latency associated peptide (LAP) of 

CC transforming growth factor-betal (TGF-betal) and integrin alphavbeta3 . 

CC The method is useful for identifying a modulator of the interaction 

CC between LAP and integrin alphavbeta3 . The method is useful for 

CC immunomodulation, in the treatment of inflammatory disease, fibrotic 

CC disease, cancer, diabetic retinopathy, bone resorption or osteoporosis, 

CC and for preventing apoptosis administering the modulator to the host. 

CC The modulator (inhibitor of the interaction between LAP-betal and 

CC integrin alphavbeta3) is useful in the manufacture of a medicament for 



CC immunomodulation . The modulator (activator of the interaction between 

CC LAP-betal and integrin alphavbeta3 ) is useful in the manufacture of 

CC medicament for preventing apoptosis. The modulator is useful for 

CC treating a inflammatory or fibrotic disease such as chronic obstructive 

CC pulmonary disorder, rheumatoid arthritis, psoriasis, restenosis, 

CC atherosclerosis, liver fibrosis and asthma. The present sequence is 

CC human TGF-betal protein. 

XX 

SQ Sequence 3 90 AA; 

Query Match 93.4%; Score 640; DB 23; Length 390; 

Best Local Similarity 99.1%; Pred . No. 3.8e-62; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M II II I I I I I I I I I I I I I I 
Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I 
Db 338 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 



Search completed: October 28, 2003, 09:06:46 
Job time : 11.0108 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched : 



October 28, 2003, 09:09:54 ; Search time 7.28061 Seconds 

(without al ignments ) 
2760.110 Million cell updates/sec 

US-10-017-372E-11 
685 

1 DYKDDDDKALDTNYCFSSTE GRKPKVEQLSNMI VRSCKCS 12 0 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 

629382 seqs, 167460630 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



629382 



Database 



Published_Applications__AA: * 
1 : /cgn2__6/ptodata/2/pubpaa/US07_PUBCOMB.pep: * 
2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 



3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW__PUB.pep: * 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 

5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW__PUB.pep: * 

6 : /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep: * 

7 : /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep: * 

8 : /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 

9 : /cgn2_6/ptodata/2/pubpaa/US09A_PUBC0MB.pep: * 
10 : /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep: * 
11 : /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: * 
12 : /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep: * 
13: / cgn2_6 /p t oda ta / 2 /pubpaa /US 1 0A_PUBCOMB . pep : * 
14 : /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep: * 
15 : /cgn2_6/ptodata/2/pubpaa/US10C_PUBCOMB.pep: * 
16 : /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep: * 
17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep: * 
18 : /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


640 


93 


. 4 


114 


10 


us- 


09 


-813 


-459-22 


Sequence 


22, Appl 
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640 


93 


.4 


114 


14 


us- 


10 


-115 


-406-21 


Sequence 


21, Appl 
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640 


93 


.4 


114 


15 


us- 


10 


-154 


-333-23 


Sequence 


23, Appl 
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640 


93 


.4 


115 


10 


us- 


09 


-859 


-211-47 


Sequence 


47, Appl 
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640 


93 


.4 


115 


10 


us- 


09 


-880 


-708-25 


Sequence 


25, Appl 


6 


640 


93 


.4 


115 


11 


us- 


09 


-872 


-856-47 


Sequence 


47, Appl 


7 


640 


93 


.4 


115 


15 


us- 


10 


-335 


-483-29 


Sequence 


29, Appl 


8 


640 


93 


.4 


139 


14 


us- 


10 


-002 


-278-8 


Sequence 


8, Appli 


9 


640 


93 


.4 


315 


11 


us- 


09 


-214 


-592-25 


Sequence 


25, Appl 


10 


640 


93 


4 


390 


10 


us- 


09 


-756 


-283A-23 


Sequence 


23, Appl 


11 


640 


93 
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390 


11 


us- 


09 


-214 


-592-26 


Sequence 


26, Appl 


12 


640 


93 


4 


390 


11 


us- 


09 


-214 


-592-28 


Sequence 


28, Appl 


13 


640 


93 
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390 


11 


us- 


09 


-214 


-592-29 


Sequence 


29, Appl 


14 


640 


93 


4 


390 


11 


us- 


09 


-214 


-592-33 


Sequence 


33, Appl 


15 


640 


93 


4 


390 


12 


us- 


10 


-276 


-947-1 


Sequence 


1, Appli 


16 


640 


93 


4 


390 


15 


us- 


10 


-087 


-268-2 


Sequence 


2, Appli 


17 


640 


93 


4 


390 


15 


us- 


10 


-087 


-268-5 


Sequence 


5, Appli 


18 


640 


93 


4 


391 


11 


us- 


09 


-214 


-592-17 


Sequence 


17, Appl 


19 


638 


93 


1 


112 


10 


us- 


09 


-813 


-271B-2 


Sequence 


2, Appli 


20 


638 


93 


1 


113 


10 


us- 


09 


-813 


-398-13 


Sequence 


13, Appl 


21 


637 


93 


0 


390 


11 


us- 


09 


-214 


-592-20 


Sequence 


20, Appl 


22 


637 


93 


0 


390 


11 


us- 


09 


-214 


-592-23 


Sequence 


23, Appl 


23 


562 


82 


0 


98 


12 


us- 


10 


-187 


-394-1 


Sequence 


1, Appli 


24 


542 


79 


1 


116 


14 


us- 


10 


-115 


-406-24 


Sequence 


24, Appl 


25 


542 


79 


1 


116 


15 


us- 


10 


-154 


-333-26 


Sequence 


26, Appl 


26 


542 


79 


1 


373 


11 


us- 


09 


-214 


-592-32 


Sequence 


32, Appl 


27 


535 


78 


1 


112 


10 


us- 


09 


-813 


-271B-8 


Sequence 


8, Appli 


28 


531 


77 


5 


304 


10 


us- 


09 


-756 


-283A-26 


Sequence 


26, Appl 


29 


505 


73 


7 


114 


14 


us- 


10 


-115 


-406-25 


Sequence 


25, Appl 


30 


505 


73 


7 


114 


15 


us- 


10 


-154 


-333-27 


Sequence 


27, Appl 


31 


505 


73 


7 


382 


11 


us- 


09 


-214 


-592-34 


Sequence 


34, Appl 
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73 


. 6 


112 


10 


us- 


09 


-813 


-271B- 12 


^ pnn pn c P 


12 


AddI 


33 
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73 


. 4 
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10 


us- 


09 


-756 


-283A-27 




27, 


AddI 


34 
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73 


. 3 


114 


10 


us- 


09 


-813 


-459-24 


Spnupn.ce 


24 , 


AddI 


35 


502 


73 


. 3 


114 


14 


US- 


10 


-115 


-4 06-23 


Cpm i p>r~| C f± 


23 




36 


502 


73 


3 


114 


15 


US- 


10 


-154 


-333-25 


^pniipnpp 


25 , 




37 


502 


73 


3 


115 


10 


us- 


09 


-880 


-708-27 


Sequence 


27, 


AddI 


38 


502 


73 


3 


410 


11 


us- 


09 


-214 


-592-22 


Sequence 


22, 


Appl 


39 


502 


73 


3 


412 


11 


us- 


09 


-214 


-592-19 


Sequence 


19, 


Appl 


40 


502 


73 


3 


412 


14 


us- 


10 


-028 


-158-21 


Sequence 


21, 


Appl 


41 


501 


73 


1 


412 


11 


us- 


09 


-214 


-592-24 


Sequence 


24, 


Appl 


42 


500 


73 


0 


112 


10 


us- 


09 


-813 


-271B-6 


Sequence 


6, 


Appli 


43 


500 


73 


0 


113 


10 


us- 


09 


-813 


-398-15 


Sequence 


15, 


Appl 


44 


499 


72 


8 


98 


12 


us- 


10 


-187 


-394-4 


Sequence 


4, 


Appli 


45 


499 


72 


8 


412 


11 


us- 


09 


-214 


-592-31 


Sequence 


31, 


Appl 



ALIGNMENTS 



RESULT 1 

US-09-813-459-22 

; Sequence 22, Application US/09813459 
; Patent No. US20020107369A1 
GENERAL INFORMATION: 

APPLICANT: Lee, Se-Jm 

Cunningham, No. US2002 0107369Aleen 
TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR- 10 
NUMBER OF SEQUENCES: 2 6 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Spensley Horn Jubas & Lubitz 

STREET: 1880 Century Park East, Suite 500 

CITY: Los Angeles 

STATE: California 

COUNTRY: USA 

ZIP: 90067 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patent In Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 9/8 13 , 459 

FILING DATE: 20-Mar-2001 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/624,635 

FILING DATE: <Unknown> 
ATTORNEY/AGENT INFORMATION: 

NAME: Wetherell, Jr., Ph.D., John R., 

REGISTRATION NUMBER: 31,678 

REFERENCE/DOCKET NUMBER: PD-3 054 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (619) 455-5100 

TELEFAX: (619) 455-5110 
INFORMATION FOR SEQ ID NO: 22: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 114 amino acids 



TYPE: amino acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
IMMEDIATE SOURCE: 

CLONE: TGF-beta-1 
FEATURE : 

NAME/ KEY : Protein 
LOCATION: 1..114 
SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
US-09-813-459-22 

Query Match 93.4%; Score 64 0; DB 10; Length 114; 

Best Local Similarity 99.1%; Pred. No. 8e-60; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

M 1 1 M I II I II I Ml II 1 1 II 1 1 1 II I! M 1 1 1 M II 1 1 M I MM M I II I MM II I 

RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 61 
KVLALYNQHNPGASAAPCCVPQALE PLP I VY YVGRKPKVEQLSNM I VRS CKCS 120 

I ! I Ml 1 1 1 1 II I II 1 1 1 1 1 I II 1 1 1 1 1 II I II 1 1 1 1 1 1 II I Ml 1 1 II 1 1 1 



Qy 


8 


Db 


2 


Qy 


68 


Db 


62 



RESULT 2 

US-10-115-406-21 

; Sequence 21, Application US/10115406 
; Publication No. US20020127612A1 
; GENERAL INFORMATION: 

; APPLICANT: THE JOHNS HOPKINS UNIVERSITY SCHOOL OF MEDICINE 
; APPLICANT: LEE, Se-Jin 

; TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR- 9 
; FILE REFERENCE: JHU1190-3 

; CURRENT APPLICATION NUMBER: US/10/115,406 
; CURRENT FILING DATE: 2002-04-02 

PRIOR APPLICATION NUMBER: 09/301,520 
; PRIOR FILING DATE: 1999-04-28 

PRIOR APPLICATION NUMBER: US 09/172,062 
; PRIOR FILING DATE: 1998-10-13 
; PRIOR APPLICATION NUMBER: US 08/491,835 
; PRIOR FILING DATE: 1995-10-23 
; PRIOR APPLICATION NUMBER: PCT/US94/00685 
; PRIOR FILING DATE: 1994-01-12 
; PRIOR APPLICATION NUMBER: US 08/003,303 

PRIOR FILING DATE: 1993-01-12 
; NUMBER OF SEQ ID NOS : 28 

SOFTWARE: Pa tent In version 3.0 
; SEQ ID NO 21 
LENGTH: 114 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-115-406-21 



Query Match 93.4%; Score 64 0; DB 14; Length 114; 

Best Local Similarity 99.1%; Pred. No. 8e-60; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 



0; 



Qy 



8 



KALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYS 67 



Db 



2 




Qy 



68 



KVLALYNQHN PGASAAPCCVPQALEPLP I VY YVGRKPKVEQLSNMI VRSCKCS 120 



Db 



62 




RESULT 3 

US-10-154-333-23 

; Sequence 23, Application US/10154333 
; Publication No. US20030109684A1 
GENERAL INFORMATION: 

APPLICANT: JOHNS HOPKINS UNIVERSITY 

TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR-3 
NUMBER OF SEQUENCES: 2 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SPENSLEY HORN JUBAS & LUBITZ 

STREET: 188 0 CENTURY PARK EAST, FIFTH FLOOR 

CITY: LOS ANGELES 

STATE: CALIFORNIA 

COUNTRY: US 

ZIP : 90067 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/154 , 333 

FILING DATE: 21-May-2002 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/09/389,705 

FILING DATE: 03-Sep-1999 

APPLICATION NUMBER: 09/153,733 

FILING DATE: < Unknown > 
ATTORNEY /AGENT INFORMATION: 

NAME : WETHERELL , JR . Ph . D . , JOHN R . 

REGISTRATION NUMBER: 31,678 

REFERENCE/DOCKET NUMBER: FD22 7 9 PCT 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (619) 455-5100 

TELEFAX: (619) 455-5110 
INFORMATION FOR SEQ ID NO: 23: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 114 amino acids 

TYPE: amino acid 

STRANDEDNESS : single 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
IMMEDIATE SOURCE: 

CLONE: TGF-beta 1 
FEATURE : 

NAME/KEY: Protein 



LOCATION : 1 . . 114 
SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
US-10-154-333-23 



Query Match 93.4%; Score 64 0; DB 15; Length 114; 

Best Local Similarity 99.1%; Pred . No. 8e-60; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I I I I I I I II I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 RALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYS 61 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 12 0 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 62 KVLALYNQHNPGASAAPCCVPQALEPLP I VYY VGRKPKVEQLSNMI VRS CKCS 114 



RESULT 4 

US-09-859-211-47 

; Sequence 47, Application US/09859211 
; Patent No. US20020157125A1 
; GENERAL INFORMATION: 

; APPLICANT: Lee, Se-Jin 

APPLICANT: McPherron, Alexandra C. 
; TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR-8 
; FILE REFERENCE: 07265/144001 
; CURRENT APPLICATION NUMBER: US/09/859,211 
; CURRENT FILING DATE: 2001-05-15 
; PRIOR APPLICATION NUMBER: 09/019,070 
; PRIOR FILING DATE: 1998-02-05 
; PRIOR APPLICATION NUMBER: 08/862,445 
; PRIOR FILING DATE: 1997-05-23 
; PRIOR APPLICATION NUMBER: 08/847,910 
; PRIOR FILING DATE: 1997-04-28 
; PRIOR APPLICATION NUMBER: 08/795,071 

PRIOR FILING DATE: 1997-02-05 
; PRIOR APPLICATION NUMBER: 08/525,596 

PRIOR FILING DATE: 1995-10-26 

PRIOR APPLICATION NUMBER: PCT/US94 / 03 0 1 9 
; PRIOR FILING DATE: 1994-03-18 
; PRIOR APPLICATION NUMBER: 08/033,923 
; PRIOR FILING DATE: 1993-03-19 
; NUMBER OF SEQ ID NOS : 51 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 47 
LENGTH: 115 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-09-859-211-47 



Query Match 93.4%; Score 64 0; DB 10; Length 115; 

Best Local Similarity 99.1%; Pred. No. 8e-60; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYS 67 

:|IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
Db 3 RALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYS 62 



Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I 
Db 63 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 115 



RESULT 5 

US-09-880-708-25 

; Sequence 25, Application US/09880708 
; Patent No. US20020165361A1 
GENERAL INFORMATION: 

APPLICANT: Lee, Se-Jm 
Huynh, Thanh 

TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR- 5 
NUMBER OF SEQUENCES: 2 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Gray Cary Ware & Freidenrich LLP 

STREET: 4365 Executive Drive, Suite 1600 

CITY: San Diego 

STATE : CA 

COUNTRY: USA 

ZIP: 92121-2189 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: Windows 95 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 9/ 88 0 , 7 08 

FILING DATE: 12-Jun-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 09/145,060 

FILING DATE: <Unknown> 

APPLICATION NUMBER: 08/003,144 

FILING DATE: 12-JAN-1993 
ATTORNEY/AGENT INFORMATION: 

NAME: Lisa A. Haile, Ph.D. 

REGISTRATION NUMBER: 38,347 

REFERENCE/DOCKET NUMBER: 07265/057002 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 858/677-1456 

TELEFAX: 619/677-1465 
INFORMATION FOR SEQ ID NO; 25: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 115 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
IMMEDIATE SOURCE: 

CLONE: TGF-beta-1 
SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
US-09-880-708-25 

Query Match 93.4%; Score 64 0; DB 10; Length 115; 

Best Local Similarity 99.1%; Pred. No. 8e-60; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

:|IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIII 
Db 3 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 62 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 12 0 

1 1 1 1 1 II 1 1! I II 1 1 1 1 1 1 1 1 1 1 1 1 1 ; h I I ' 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 

Db 63 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 115 



RESULT 6 

US-09-872-856-47 

; Sequence 47, Application US/09872856 
; Publication No. US2003 0074 68 0A1 
; GENERAL INFORMATION: 

; APPLICANT: Johns Hopkins University School of Medicine 
; APPLICANT: Lee, Se-Jm 

APPLICANT: McPherron, Alexandra 

TITLE OF INVENTION: Growth Differentiation Factor-8 
; FILE REFERENCE: JHU1120-17 

; CURRENT APPLICATION NUMBER : US/09/872,856 

; CURRENT FILING DATE: 2001-06-01 

; PRIOR APPLICATION NUMBER: US 09/124,180 

; PRIOR FILING DATE: 1998-07-28 

; PRIOR APPLICATION NUMBER: US 09/019,070 

; PRIOR FILING DATE: 1998-02-05 

; PRIOR APPLICATION NUMBER: US 08/862,445 

; PRIOR FILING DATE: 1997-05-23 

; PRIOR APPLICATION NUMBER: US 08/847,910 

; PRIOR FILING DATE: 1997-04-28 

; PRIOR APPLICATION NUMBER: US 08/795,071 

PRIOR FILING DATE: 1997-02-05 
; PRIOR APPLICATION NUMBER: US 08/525,596 
; PRIOR FILING DATE: 1995-10-25 
; PRIOR APPLICATION NUMBER: PCT/US 94/03019 
; PRIOR FILING DATE: 1994-03-18 
; PRIOR APPLICATION NUMBER: US 08/033,923 
; PRIOR FILING DATE: 1993-03-19 
; NUMBER OF SEQ ID NOS : 53 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 47 

LENGTH: 115 

TYPE : PRT 
; ORGANISM: Homo sapiens 
US-09-872-856-47 

Query Match 93.4%; Score 640; DB 11; Length 115; 

Best Local Similarity 99.1%; Pred . No. 8e-60; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I I I I II I I I I II I I I I I I 
Db 3 RALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYS 62 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 12 0 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 63 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNM I VRSCKCS 115 



RESULT 7 

US-10-335-483-29 

; Sequence 29, Application US/10335483 
; Publication No. US20030120058A1 
GENERAL INFORMATION: 

APPLICANT: Huynh, Thanh 
Lee, Se-Jin 

TITLE OF INVENTION: GROWTH DIFFERENTIATION FACTOR-8 
NUMBER OF SEQUENCES: 32 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fish & Richardson P.C. 

STREET: 4225 Executive Square, Suite 1400 

CITY: La Jolla 

STATE: CA 

COUNTRY: US 

ZIP: 92037 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: Wmdows95 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 1 0/33 5 , 4 8 3 

FILING DATE: 31-Dec-2002 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 9 / 177 , 8 6 0 

FILING DATE: <Unknown> 

APPLICATION NUMBER: 08/525,596 

FILING DATE: 19-SEP-1995 

APPLICATION NUMBER: PCT/US94 / 07762 

FILING DATE: 08-JUL-1994 
ATTORNEY /AGENT INFORMATION: 

NAME: Wetherell, Jr., Ph.D, John R . 

REGISTRATION NUMBER: 31,678 

REFERENCE/DOCKET NUMBER: 07265/075001 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: 619-678-5070 

TELEFAX: 619-678-5099 
INFORMATION FOR SEQ ID NO: 29: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 115 amino acids 

TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
IMMEDIATE SOURCE: 

CLONE: TGF-beta-1 
FEATURE : 

NAME/ KEY : Protein 

LOCATION: 1 . . 115 
SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
US-10-335-483-29 

Query Match 93.4%; Score 64 0; DB 15; Length 115; 

Best Local Similarity 99.1%; Pred. No. 8e-60; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 8 KALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYS 67 

: I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I I I I I I 
Db 3 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 62 

Qy 68 KVT^LYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 120 

I MINI llllllllll I III MINI I MINI MINI MUM MM III 

Db 63 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 115 



RESULT 8 
US-10-002-278-8 

Sequence 8, Application US/10002278 
Publication No. US20020132334A1 
GENERAL INFORMATION: 
APPLICANT: Jessell, Thomas M. 
APPLICANT: Basler, Konard 
APPLICANT: Yamada, Toshiya 

TITLE OF INVENTION: CLONING, EXPRESSION AND USES OF DORSALIN-1 
FILE REFERENCE: 0575/40314-A 
CURRENT APPLICATION NUMBER: US/10/002,278 
CURRENT FILING DATE: 2001-11-02 
NUMBER OF SEQ ID NOS : 18 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 8 
LENGTH: 13 9 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: COOH- terminus of TGF-Beta 1 
NAME /KEY: DOMAIN 
LOCATION: (1) . . (139) 
OTHER INFORMATION: 
US-10-002-278-8 



Query Match 93 . 4%; 

Best Local Similarity 99.1%; 
Matches 112; Conservative 



Score 64 0; DB 14; 
Pred. No. 9.9e-60; 
1; Mismatches 0; 



Length 13 9; 



Indels 



0 ; Gaps 



0; 



Qy 

Db 

Qy 
Db 



8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 

MMIIMMMMMMMMMMMIMMMMIMMMMMMMMIIMM 

27 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 
68 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 120 

Mlllll IIMIIIIIIIIIIIIIIIIIIIIIIMI lllllllll Mill 

87 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 13 9 



67 



86 



RESULT 9 

US-09-214-592-25 

; Sequence 25, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKen j i 

; APPLICANT: Sato , CYasuf umi 



; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/214 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 25 
LENGTH: 315 
TYPE : PRT 
ORGANISM: bovine 
US-09-214-592-25 

Query Match 93.4%; Score 64 0; DB 11; Length 315; 

Best Local Similarity 99.1%; Pred . No. 2.4e-59; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I M I I I I I I I I I II I I ! , I 1' I I II ! I M I I I I I I , M I II I II I I II I I I I II i 1 1 1 ' 
Db 203 RALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYS 262 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 12 0 

I I I II I I I I I I I I I I I I I M I ' I I II ! I I I I I I I I I I , I II I ll ! I II I I I I 
Db 263 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 315 



RESULT 10 
US-09-756-283A-23 

; Sequence 23, Application US/09756283A 
; Patent No. US2 0020151478A1 
; GENERAL INFORMATION: 

APPLICANT: Cherna j ovsky , Yuti 

APPLICANT: Dreja, Hanna Stina 
; APPLICANT: Adams, Gillian 
; TITLE OF INVENTION: Latent Fusion Protein 
; FILE REFERENCE: 0623.1000000 
; CURRENT APPLICATION NUMBER: US/09/756 , 283A 
; CURRENT FILING DATE: 2001-01-09 
; NUMBER OF SEQ ID NOS: 100 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 23 
LENGTH: 390 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-756-283A-23 

Query Match 93.4%; Score 640; DB 10; Length 390; 

Best Local Similarity 99.1%; Pred. No. 3e-59; 

Matches 112; Conservative 1 ; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ( I I I I I I I I I I I I 
Db 278 RALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 120 

II MINI III MINI I II III MM MM II I Mill III MINIUM I 



Db 338 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 390 



RESULT 11 
US-09-214-592-26 

Sequence 26, Application US/09214592A 
Publication No. US20030027218A1 
GENERAL INFORMATION: 
APPLICANT: Yamasaki , CMotoo 
APPLI CANT : Sh iba ta , CKen j i 
APPLI CANT : Sato , CYasuf umi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER : US/09/214 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 26 
LENGTH: 3 90 
TYPE: PRT 
ORGANISM: porcine 
US-09-214-592-26 

Query Match 93.4%; Score 64 0; DB 11; Length 3 90; 

Best Local Similarity 99.1%; Pred . No. 3e-59; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0, 
Qy 8 KALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYS 67 

: 1 1 1 1 1 h III I! 1 1 1 1 Ml ! 1 1 L 1 1 1 II : 1 1 Ml I : II II 1 1 1 1 1 1 1 1 ' 1 1 ! I II I II 

Db 278 RALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 120 

IMMIMMIII MMMIIMM IMMMMMIMI MMMMM 

Db 338 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 390 



RESULT 12 
US-09-214-592-28 

; Sequence 28, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKen] i 

; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS: 34 
; SOFTWARE : 
; SEQ ID NO 28 

LENGTH: 3 90 

TYPE: PRT 



ORGANISM: canine 
US-09-214-592-28 

Query Match 93.4%; Score 640/ DB 11; Length 390; 

Best Local Similarity 99.1%; Pred . No. 3e-59; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 120 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I II I I I I I I I I I I I I I I II 
Db 338 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNM I VRSCKCS 3 90 



RESULT 13 
US-09-214-592-29 

Sequence 29, Application US/09214592A 
Publication No. US20030027218A1 
GENERAL INFORMATION: 
APPLICANT: Yamasaki , CMotoo 
APPLICANT: Shibata,CKenji 
APPLICANT: Sato,CYasufumi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/214 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 29 
LENGTH: 3 90 
TYPE : PRT 
ORGANISM: ovine 
US-09-214-592-29 

Query Match 93.4%; Score 64 0; DB 11; Length 3 90; 

Best Local Similarity 99.1%; Pred. No. 3e-59; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I I I I I I II I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALE PL PI VYYVGRKPKVEQLSNM I VRSCKCS 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 338 KVLALYNQHNPGASAAPCCVPQALEPLP! VYYVGRKPKVEQLSNM I VRSCKCS 3 90 



RESULT 14 
US-09-214-592-33 

; Sequence 33, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION : 

; APPLICANT: Yamasaki , CMotoo 



APPLICANT: Shibata , CKenj i 
APPLICANT: Sato, CYasufumi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/214 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 33 
LENGTH: 3 90 
TYPE: PRT 
ORGANISM: simian 
US-09-214-592-33 

Query Match 93.4%; Score 640; DB 11; Length 390; 

Best Local Similarity 99.1%; Pred . No. 3e-59; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0, 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 78 RALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYS 33 7 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 120 

1 1 1 1 1 II M I II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 338 KVLALYNQHNPGASAAPCC VPQALE PLP I VYYVGR KPKVEQLSNM I VRSCKCS 3 90 



RESULT 15 
US-10-276-947-1 

; Sequence 1, Application US/10276947 

; Publication No. US20030176315A1 

; GENERAL INFORMATION: 

; APPLICANT: GLAXO GROUP LIMITED 

; TITLE OF INVENTION: MEDICAL PRODUCTS 

; FILE REFERENCE: PG3 94 9 

; CURRENT APPLICATION NUMBER: US/ 1 0/276 , 94 7 
; CURRENT FILING DATE: 2002-11-21 

PRIOR APPLICATION NUMBER: GB 0012991.6 
; PRIOR FILING DATE: 2000-05-26 
; PRIOR APPLICATION NUMBER: GB 0100286.4 
; PRIOR FILING DATE: 2001-01-05 
; NUMBER OF SEQ ID NOS: 1 
; SOFTWARE: Patentln Ver. 2.0 
; SEQ ID NO 1 

LENGTH: 3 90 

TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-276-947-1 

Query Match 93.4%; Score 640; DB 12; Length 390; 

Best Local Similarity 99.1%; Pred. No. 3e-59; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYS 67 

: I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 



Qy 

Db 



68 KVLiALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 i! 1 1 1 1 1 1 . 1 1 II II' 1 1 ! 1 1 1 ll 1 1 1 1 1 . 1 1 Ml I ' 

338 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



Search completed: October 28, 2003, 09:27:59 
Job time : 7.28061 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 

Run on: October 28, 2003, 00:37:16 ; Search time 3.96533 Seconds 

(without al ignment s ) 
2910.285 Million cell updates/sec 

Title: US-10-017-372E-11 
Perfect score: 685 

Sequence: 1 DYKDDDDKALDTNYCFSSTE GRKPKVEQLSNM I VRSCKCS 120 

Scoring table: 

Searched: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 
283308 seqs, 96168682 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283308 



Database 



PIR_76 : * 
pirl : * 

pir2 : * 

pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 



1 


640 


93 


4 


2 


640 


93 


4 


3 


640 


93 


4 


4 


640 


93 


4 



315 2 A40057 

3 90 1 WFHU2 

390 2 JC4023 

390 2 A26960 



transforming growt 
transforming growt 
transforming growt 
transforming growt 



5 


640 


93 . 


4 


390 


2 


A27512 


transforming growt 


6 


640 


93 . 


4 


390 


2 


146463 


transforming growt 


7 


640 


93 . 


4 


391 


2 


S01413 


transforming growt 


8 


637 


93 . 


o 


390 


1 


WFMS2 


trans forming growt 


9 


637 


93 . 


o 


390 


2 


S10219 


transforming growt 


10 


625 


91 . 


2 


130 


2 


148196 


transforming growt 


11 


542 


79 . 


1 


373 


2 


A41918 


transforming growt 


12 


505 


73 . 


7 


382 


2 


B61036 


transforming growt 


13 


502 


73 . 


3 


410 


2 


A41397 


transforming growt 


14 


502 


73 . 


3 


412 


2 


A36169 


transforming growt 


15 


501 


73 . 


, 1 


410 


2 


A55706 


transforming growt 


16 


499 


72 . 


. 8 


412 


2 


A34939 


transforming growt 


17 


495 


72 . 


3 


409 


2 


S01825 


trans forming growt 


1 ft 

J. o 


T J X 


7 1 . 


7 


4 12 


2 


A3 948 9 


transforming growt 


1 9 


484 


70 . 


7 


414 


1 


WFMKB2 


transforming growt 


20 


484 


70 . 


. 7 


4 14 


2 


A31249 


trans forming growt 


21 


484 


70 . 


, 7 


442 


2 


B31249 


transforming growt 


22 


482 


70 . 


. 4 


112 


2 


A61439 


transforming growt 


23 


482 


70 . 


, 4 


414 


1 


WFMSB2 


trans forming growt 


24 


480 


70 . 


, 1 


413 


1 


WFXLB2 


transforming growt 


25 


204 


29 , 


. 8 


425 


2 


147072 


inhibin beta -A cha 


26 


200 


29 


. 2 


424 


1 


WFPGBA 


inhibin beta-A cha 


27 


200 


29 


. 2 


424 


1 


S31440 


inhibin beta-A cha 


28 


200 


29 


. 2 


425 


1 


S50898 


inhibin beta-A cha 


29 


200 


29 


. 2 


426 


1 


B24248 


inhibin beta-A cha 


30 


196 


28 


. 6 


424 


1 


B40905 


inhibin beta-A cha 


31 


185 


27 


. 0 


350 


2 


JC5241 


activin beta E cha 


32 


181 


26 


. 4 


413 


2 


JC4862 


activin beta-A cha 


33 


179 


26 


. l 


115 


2 


PN0504 


activin beta A cha 


34 


177 


25 


. 8 


455 


2 


A43918 


TGF-beta-related p 


35 


174 


25 


. 4 


207 


2 


S37618 


vgr protein - rat 


36 


174 


25 


. 4 


367 


2 


JC4151 


activin beta D cha 


37 


174 


25 


. 4 


513 


1 


BMHU6 


bone morphogenetic 


38 


173 


25 


. 3 


510 


2 


A54798 


Vg-1 -related prote 


39 


171 


25 


. 0 


43 


2 


B26356 


transforming growt 


40 


169 


24 


. 7 


352 


2 


JC2466 


inhibin beta-C cha 


41 


169 


24 


. 7 


372 


2 


C39364 


GDF-1 embryonic gr 


42 


168 . 5 


24 


. 6 


360 


2 


A29619 


Vgl embryonic grow 


43 


167 . 5 


24 


. 5 


370 


2 


151199 


activin beta B sub 


44 


167.5 


24 


. 5 


430 


2 


JQ1184 


osteogenic protein 


45 


166.5 


24 


.3 


313 


2 


151284 


bone morphogenetic 



ALIGNMENTS 



RESULT 1 
A40057 

transforming growth factor beta-1 precursor - bovine (fragment) 

N;Alternate names: beta-TGF; cart ilage- inducing factor -A; EGF-dependent TGF or 

dEGF; MGF-b; milk growth factor b; TGF-type II 

C; Species: Bos primigenius taurus (cattle) 

C;Date: 28-Feb-1992 #sequence_revision 28-Feb-1992 #text_change 16-Jul-1999 
C;Accession: A40057; A42320; A05284; A24322; B61439 

R ; Van Obberghen- Schilling, E. ; Kondaiah, P.; Ludwig, R.L.; Sporn, M.B.; Baker, 
C.C. 

Mol. Endocrinol. 1, 693-698, 1987 



A; Title: Complementary deoxyribonucleic acid cloning of bovine transforming 
growth factor-betal . 

A/Reference number: A40057; MUID : 91042552 ; PMID:3153459 
A; Accession : A4 0 057 
A; Molecule type: mRNA 
A/Residues : 1-315 <VAN> 

A; Cross-references: GB:M36271; NID: gl63747 ; PIDN : AAA30778 . 1 ; PID:gl63748 
R;Ogawa, Y. ; Schmidt, D.K.; Dasch, J.R.; Chang, R.J.; Glaser, C.B. 
J . Biol. Chem. 267, 2325-2328, 1992 

A;Title: Purification and characterization of transforming growth f actor-beta2 . 3 

and -betal.2 heterodimers from bovine bone. 

A;Reference number: A42320; MUID: 92129307 ; PMID: 1733936 

A /Accession : A4 2 320 

A;Molecule type: protein 

A;Residues: 204 -209 , 'X 211-217 <OGA> 

R;Roberts, A.B.; Anzano, M.A. ; Meyers, C.A. ; Wideman, J.; Blacher, R. ; Pan, 
Y.C.E.; Stein, S.; Lehrman, S.R.; Smith, J.M. ; Lamb, L.C.; Sporn, M.B. 
Biochemistry 22, 5692-5698, 1983 

A; Title: Purification and properties of a type beta transforming growth factor 
from bovine kidney. 

A/Reference number: A05284; MUID : 84 104793 ; PMID:6607069 
A, -Access ion ; AO 52 84 
A;Molecule type: protein 
A;Residues: 204-218 <ROB> 

R;Seyedin, S.M.; Thompson, A.Y.; Bentz, H. ; Rosen, D.M.; McPherson, J.M. ; Conti, 
A.; Siegel, N.R.; Galluppi, G.R.; Piez, K.A. 
J. Biol. Chem. 261, 5693-5695, 1986 

A;Title: Cart ilage- inducing factor-A. Apparent identity to transforming growth 
factor-beta . 

A;Reference number: A24322; MUID : 86195954 ; PMID: 3754555 
A;Accession: A24322 
A;Molecule type: protein 
A;Residues: 204-233 <SEY> 

R;Jin, Y.; Cox, D.A.; Knecht, R.; Raschdorf, F. ; Cerletti, N. 
J. Protein Chem. 10, 565-575, 1991 

A;Title: Separation, purification, and sequence identification of TGF-betal and 
TGF-beta2 from bovine milk. 

A,Reference number: A61439; MUID: 92189724 ; PMID:1799413 
A;Accession : B61439 
A;Molecule type: protein 

A;ResidueS: 204-209, 'X' ,211-217, 'XX' ,220-232 <JIN> 

C; Comment: This polypeptide is composed of two polypeptide chains cross-linked 
by disulfide bonds. It has been found in neoplastic and non-neoplastic tissues. 
C; Comment: Type II TGF does not bind to the EGF receptor and lacks intrinsic 
mitogenic activity, but in soft agar, it reacts synergist ically with either type 
I TFG or EGF, and induces cell proliferation. Cells grown in monolayer do not 
respond in a similar manner to these growth factors, but morphologically do 
acquire a transformed phenotype. 
C; Superf amily : inhibin 

C; Keywords: glycoprotein; growth factor; heterodimer 

F;204-315/Product : transforming growth factor beta-1 #status experimental <MAT> 
F; 7 , 61 , 101/Binding site: carbohydrate (Asn) (covalent) #status predicted 



Query Match 93.4%; Score 640; DB 2; Length 315; 

Best Local Similarity 99.1%; Pred. No. 9.4e-60; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I I I I 1 I I I I I i I I I I I I I I I ! I I I ! : I I I ^ I I . I I I I ' Ill 

Db 2 03 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 2 62 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

i . 1 1 1 1 1 1 1 1 r i , 1 1 1 1 ii i ill ii 1 1 ii 1 1 1 1 ii i ill ii i m 1 1 ii 1 1 1 1 

Db 263 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 315 



RESULT 2 
WFHU2 

transforming growth factor beta-1 precursor [validated] - human 
N;Alternate names: growth- inhibitory factor; TGF type 2; TGF-beta 
C; Species: Homo sapiens (man) 

C;Date: 28-Feb-1986 #sequence_revision 19-Oct-1995 #text__change 08-Dec-2000 
C;Accession: A27513; A01395; A22290; 159664; S53444 
R;Derynck, R.; Rhee, L. ; Chen, E.Y.; Van Tilburg, A. 
Nucleic Acids Res. 15, 3188-3189, 1987 

A; Title: Intron-exon structure of the human transforming growth factor-beta 
precursor gene. 

A;Reference number: A27513; MUID: 87174845 ; PMID:3470709 
A; Access ion : A27513 
A; Molecule type: DNA 
A;Residues: 1-390 <DER> 

A; Cross-references: GB:X05839; GB:Y00112; NID:g37097 ; PIDN: CAA29283 . 1 ; 
PID:gl212989 

R;Derynck, R.; Jarrett, J. A.; Chen, E.Y.; Eaton, D.H.; Bell, J.R.; Assoian, 
R.K.; Roberts, A.B.; Sporn, M.B. ; Goeddel, D.V. 
Nature 316, 701-705, 1985 

A/Title : Human transforming growth factor-beta complementary DNA sequence and 
expression in normal and transformed cells. 
A;Reference number: A01395; MUID: 85296301 ; PMID:3861940 
A; Access ion : AO 13 95 
A; Molecule type: mRNA 

A; Residues: 1-9, ' P' , 11-24, ' P ' ,26-159, 'R' , 160-3 9 0 <DE2> 

A; Cross-references: GB:X02812; GB:J05114; NID:g37092; PIDN : CAA26580 . 1 ; 
PID:g37093 

A; Note: the authors suggest that residues 8-23 could represent the hydrophobic 

core of an amino-terminal signal peptide 

R;Massague, J.; Like, B . 

J . Biol. Chem. 260, 2636-2645, 1985 

A; Title: Cellular receptors for type beta transforming growth factor. Ligand 

binding and affinity labeling in human and rodent cell lines. 

A;Reference number: A22290; MUID: 85131019 ; PMID:2982829 

A;Accession: A22290 

A;Molecule type: protein 

A;Residues: 279-295 , * XX ' , 298 -301 <MAS> 

R;Urushizaki , Y. ; Niitsu, Y.; Terui, T. ; Koshida, Y.; Mahara, K. ; Kongo, Y. ; 
Urushizaki, I.; Takahashi, Y. ; I to, H. 
Tumor Res. 22, 41-55, 1987 

A; Title: Cloning and expression of the gene for human transforming growth 
factor-beta in Escherichia coli. 
A; Reference number: 15 9664 
A; Accession: 159664 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 279-390 <RES> 



A; Cross-references: GB:M38449; NID:g339557; PIDN: AAA36735 . 1 ; PID:g339558 
R;Stam, K. ; Stewart, A. A. ; Qu, G.Y.; Iwata, K.K.; Fenyoe, D. ; Chait, B.T.; 
Marshak, D.R.; Haley, J.D. 
Biochem. J. 305, 87-92, 1995 

A;Title: Physical and biological characterization of a growth- inhibitory 

activity purified from the neuroepithelioma cell line A673 . 

A;Reference number: S53444; MUID : 95126934 ; PMID:7826358 

A; Access ion : S534 44 

A; Status : preliminary 

A;Molecule type: protein 

A;ResidueS: 279-297 <STA> 

C; Comment: The mature protein is the carboxyl -terminal segment of a precursor 
polypeptide; the active molecule is a dimer of identical polypeptide chains 
linked by an interchain disulfide bond. 
C; Genetics : 

A; Gene: GDB : TGFB1 ; TGFB 

A; Cross-references : GDB : 12 072 9 ; OMIM: 190180 
A ; Map position: 19ql3 . 2 - 19ql3 . 2 
C/Superf amily : inhibin 

C; Keywords: glycoprotein; growth factor; homodimer; mitogen; transformation 
F ; 1 - 18 /Domain : signal sequence #status predicted <SIG> 
F; 19 -2 78 /Domain : propeptide #status predicted <PRO> 
F ; 244-246/Region: cell attachment (R-G-D) motif 

F;279-390/Product : transforming growth factor beta-1 #status experimental <MAT> 
F;82, 136, 176/Binding site: carbohydrate (Asn) (covalent) ^status predicted 

Query Match 93.4%; Score 640; DB 1; Length 390; 

Best Local Similarity 99.1%; Pred. No. 1.2e-59; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

= 1 II I III I II I III I II I II 1 1 II I III I III Ml II 1 1 Ml I II 1 1 Ml I II I II I II 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHN PGASAAPCCVPQALE PLP I VYYVGRKP KVEQLSNM I VRS CKCS 120 

M I II II I I I I I II I II I I I I I I I I M II I II I I I I I I I II I I I II I I I I I II 
Db 338 KVLALYNQHN PGASAAPCCVPQALE PLP I VYYVGRKPKVEQLSNM I VRS CKCS 3 90 



RESULT 3 
JC4023 

transforming growth factor beta-1 - dog 
C; Species: Canis lupus familiaris (dog) 

C;Date: 13-Jun-1995 #sequence_revision 14-Jul-1995 #text_change 24-Nov-1999 
C; Access ion: JC4 023 

R; Manning, A.M.; Auchampach, J. A.; Drong, R.F.; Slightom, J.L. 
Gene 155, 307-308, 1995 

A; Title: Cloning of a canine cDNA homologous to the human transforming growth 
factor-beta 1-encoding gene. 

A;Reference number: JC4023; MUID: 95237630; PMID:7721110 
A; Accession : JC4 023 
A; Molecule type: mRNA 
A; Residues: 1-390 <MAN> 

A; Cross-references : GB:L34956; NID:g516071; PIDN : AAA51458 . 1 ; PID:g516072 

C; Comment: This factor plays a multifunctional role as a regulator of mammalian 

cell growth and as a modulator of immune responses. 

C;Genetics : 



A;Gene: tgf-betal 

C; Super family: inhibin 

C;Keywords: growth factor; transforming protein 

F;288-390/Product : transforming growth factor beta 1 #status predicted <MAT> 

Query Match 93.4%; Score 64 0; DB 2; Length 3 90; 

Best Local Similarity 99.1%; Pred. No. 1.2e-59; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I II 
Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVTjALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 12 0 

, 1 1 1 1 1 1 1 1 ' I : : 1 1 1 1 I III 1 1 : 1 1 i ! ! I 

Db 338 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 4 
A26960 

transforming growth factor beta-1 precursor - green monkey 
C; Species: Cercopithecus aethiops (green monkey, grivet) 

C;Date: 05-Oct-1988 #sequence__revis ion 05-Oct~1988 #text_change 24-Nov-1999 
C;Accession: A26960 

R;Sharples, K. ; Plowman, G.D.; Rose, T.M. ; Twardzik, D.R.; Purchio, A.F. 
DNA 6, 239-244, 1987 

A;Title: Cloning and sequence analysis of simian transforming growth factor-beta 
cDNA . 

A;Reference number: A26960; MUID : 87246074 ; PMID:3474130 
A; Access ion: A2 6 960 
A; Molecule type: mRNA 
A;Residues: 1-390 <SHA> 

A; Cross-references : GB:M16658; NID:gl76552; PIDN : AAA35369 . 1 ; PID:gl76553 
C; Superf amily : inhibin 
C; Keywords: growth factor 

F ; 1 - 1 6 /Domain : signal sequence #status predicted <SIG> 

F; 17-390/Product : transforming growth factor beta #status predicted <MAT> 

Query Match 93.4%; Score 64 0; DB 2; Length 3 90; 

Best Local Similarity 99.1%; Pred. No. 1.2e-59; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

^ 1 1 !i 1 1 1 1 1 1 : i ii 1 1 1 ; i 1 1 ii 1 1 1 1 1 1 1 1 1 :i ii mi 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 in 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

IIIIIIIIIIIIIIIIMIIIIIMIIII II I'll! I IIIIIIIIM 

Db 338 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 5 
A27512 

transforming growth factor beta-1 precursor - pig 
N;Alternate names: TGF-beta 

C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 05-Jun-1988 #sequence_revision 05-Jun-1988 #text_change 16-Jul-1999 



C;Accession: A27512; A26356; 146657 

R;Derynck, R . ; Rhee, L. 

Nucleic Acids Res. 15, 3187, 1987 

A;Title: Sequence of the porcine transforming growth factor-beta precursor. 
A/Reference number: A27512; MUID : 87174844 ; PM1D : 3470708 
A; Accession: A2 7512 
A; Molecule type: mRNA 
A;Residues: 1-390 <DER> 

R;Cheifetz, S.; Weatherbee, J.A. ; Tsang, M.L.S.; Anderson, J.K.; Mole, J.E.; 
Lucas, R.; Massague, J. 
Cell 48, 409-415, 1987 

A;Title: The transforming growth factor-beta system, a complex pattern of cross- 
reactive ligands and receptors. 

A;Reference number: A90890; MUID:87102890; PMID:2879635 
A; Accession: A2 63 56 
A;Molecule type: protein 
A;Residues: 279-322 <CHE> 

R;Kondaiah, P.; Van Obberghen- Schilling, E . ; Ludwig, R.L.; Dhar, R. ; Sporn, 
M.B. ; Roberts, A . B . 

J. Biol. Chem. 263, 18313-18317, 1988 

A;Title: cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 
Evidence for alternate splicing and polyadenylation . 
A;Reference number: 146657; MUID : 89054 010 ; PMID:2461367 
A; Accession: 14 6657 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-3 90 <KON> 

A; Cross-references : GB:M23703; NID:g755044; PIDN : AAA64 616 . 1 ; PID:g755045 
C;Genetics : 

A; Gene: TGFB ; TGF-beta-1 
C; Superf amily : inhibin 
C; Keywords: growth factor 



Query Match 93 . 4%; 

Best Local Similarity 99.1%; 
Matches 112; Conservative 



Score 64 0; DB 2; Length 3 90; 
Pred. No. 1.2e-59; 
1; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 



Db 



8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

UIIIIIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 

278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 



Qy 

Db 



68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

1 1 1 ; i ii i ii 1 1 1 1 :i ' 1 1 1 1 1 n 1 1 1 1 1 1 1 1 1 1 r.i i ii 1 1 1 1 1 1 1 ii 1 1 

338 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 6 
146463 

transforming growth factor beta-1 - sheep 

C; Species: Ovis orientalis aries, Ovis ammon aries (domestic sheep) 

C;Date: 19-Dec-1997 #sequence_revision 19-Dec-1997 #text__change 24-Nov-1999 

C;Accession: 146463; S45115 

R;Woodall, C.J.; McLaren, L.J.; Watt, N.J. 

Gene 150, 371-373, 1994 

A; Title: Sequence and chromosomal localisation of the gene encoding ovine latent 
transforming growth factor-beta 1. 

A; Reference number: 146463; MUID : 95121932 ; PMID:7821809 



A /Accession: 14 64 63 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A/Residues: 1-390 <WOO> 

A; Cross -references : EMBL : X76 916 ; NID : g496648 ; PIDN : CAA54242 . 1 ; PID:g496649 
A;Note: submitted to the EMBL Data Library, December 1993 
C; Superf amily : inhibin 

Query Match 93.4%; Score 64 0; DB 2; Length 390; 

Best Local Similarity 99.1%; Pred. No. 1.2e-59; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

• ' II 1 . 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1! ! 1 1 II I M II 1 , 1 1 1 1 hi II , 1 1 1 M 1 1 1 1 ! II I II 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII1IIIIIIIIIIIIIIIIIM 
Db 338 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 7 
S01413 

transforming growth factor beta-1 precursor - chicken 
C;Species: Gallus gallus (chicken) 

C;Date: 30-Jun-1989 #sequence_revision 30-Jun-1989 #text_change 24-Nov-1999 
C; Access ion: SO 14 13 

R;Jakowlew, S.B.; Dillard, P.J.; Sporn, M.B.; Roberts, A.B. 
Nucleic Acids Res. 16, 8730, 1988 

A; Title: Nucleotide sequence of chicken transforming growth factor-beta 1 (TGF- 
beta 1) . 

A/Reference number: S01413; MUID: 88335639; PMID:3166520 
A; Access ion: SO 14 13 
A; Molecule type: DNA 
A;Residues: 1-391 <JAK> 

A;Cross-references: EMBL:X12373; NID:g63808; PIDN : CAA3 0933 . 1 ; PID:g63809 
C; Superf amily : inhibin 
C; Keywords : growth factor 

Query Match 93.4%; Score 640; DB 2; Length 391; 

Best Local Similarity 99.1%; Pred. No. 1.2e-59; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: IT I I M I I I ' I I I I I I I I II I II I I I il I I I I I I I I I II I II I II I I I I I II I I I I I I 
Db 279 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 338 

Qy 68 KVLALYNQHNPGASAAPCCVPQALE PLP I VYYVGRKPKVEQLSNM I VRS CKCS 120 

IIIIM IIII IIMIMI MIMII II III Ml INIIM 

Db 33 9 KVLALYNQHNPGASAA PCCVPQALE PLP I VY Y VGRKP KVEQLSNM I VRS CKCS 3 91 



RESULT 8 
WFMS2 

transforming growth factor beta-1 precursor - mouse 
N;Alternate names: TGF type 2; TGF-beta 
C; Species: Mus musculus (house mouse) 



C;Date.- 04-Dec-1986 #sequence_revision 04-Dec-1986 #text__change 24-Nov-1999 
C;Accession: A01396 

R;Derynck, R . ; Jarrett, J. A. ; Chen, E.Y.; Goeddel , D.V. 
J. Biol. Chem. 261, 4377-4379, 1986 

A; Title : The murine transforming growth factor-beta precursor. 
A/Reference number: A01396; MUID : 86168 12 9 ; PMID:3007454 
A /Access ion: AO 13 96 
A /Molecule type: mRNA 
A/Residues : 1-390 <DER> 

A/Cross-references: GB:M13177; NID:g201952; PIDN: AAA40423 . 1 ; PID:g201953 

A; Note: the authors suggest that residues 8-23 could represent the hydrophobic 

core of an amino- terminal signal peptide 

C; Comment: The mature protein is the carboxyl -terminal segment of a precursor 
polypeptide/ the active molecule is a dimer of identical polypeptide chains 
linked by an interchain disulfide bond. 
C; Superfamily : inhibin 

C; Keywords: glycoprotein/ growth factor/ growth regulation/ homodimer/ mitogen; 
transformation 

F ; 1 -23 /Domain : signal sequence #status predicted <SIG> 
F/24-278/Domain: propeptide #status predicted <PRO> 
F/244-246/Region: cell attachment (R-G-D) motif 

F;279-390/Product : transforming growth factor beta-1 #status predicted <MAT> 
F/82, 136, 176/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 93.0%; Score 637/ DB 1; Length 3 90/ 

Best Local Similarity 98.2%; Pred. No. 2.4e-59; 

Matches 111; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCWQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYS 67 

:> M 1 1 ! I II 1 1 1 1 1! I II 1 1 1 1 1 il 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 II 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 12 0 

IIIIIIIIMI Ihll llllllll llllllll!IIIIIMIIIIIi:r 

Db 338 KVLALYNQHNPGASASPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 9 
S10219 

transforming growth factor beta-1 precursor - rat 
N;Alternate names: TGF type 2; TGF-beta 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 12-Feb-1993 #sequence_revision 12-Feb-1993 #text_change 24-Nov-1999 
C;Accession: S10219; PT0023; S02267 

R;Qian, S.W.; Kondaiah, P.; Roberts, A.B.; Sporn, M.B. 
Nucleic Acids Res. 18, 3059, 1990 

A;Title: cDNA cloning by PCR of rat transforming growth factor beta-1. 
A; Reference number: S10219; MUID : 90272425 ; PMID:2349108 
A; Access ion: SI 02 19 
A; Molecule type: mRNA 
A;Residues: 1-390 <QIA> 

A; Cross-references: EMBL:X52498; NID:g57341; PIDN : CAA3 674 1 . 1 / PID:g57342 
R;Okada, F. ; Yamaguchi, K. / Ichihara, A.; Nakamura, T. 
J. Biochem. 106, 304-310, 1989 

A/Title: Purification and structural analysis of a latent form of transforming 
growth factor-beta from rat platelets. 

A/Reference number: PT0023/ MUID : 90036779 / PMID:2478527 



A; Accession : PT0023 

A; Molecule type : protein 

A;Residues: 30-32 , ' X \ 34 -38 ,' Q 1 , 40-42 , ' X 44 <OKA> 
R;Okada, F . ; Yamaguchi, K. / Ichihara, A.; Nakamura, T. 
FEBS Lett. 242, 240-244, 1989 

A/Title: One of two subunits of masking protein in latent TGF-beta is a part of 
pro-TGF-beta . 

A;Reference number: S02267; MUID : 8 912 1 078 ; PMID:2914605 
A /Access ion: S02267 
A;Molecule type: protein 

A;Residues: 30-32 , 1 X », 34 -38 , ' Q \ 40-42 , 'X 44 <OK2> 
C; Superf amily : inhibin 

C;Keywords: glycoprotein; growth factor; integrin binding 
F ; 1-29/ Doma in : signal sequence #status predicted <SIG> 
F;30-278/Domam: propeptide #status experimental <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F;279-390/Product : transforming growth factor beta-1 #status predicted <MAT> 
F; 82 , 136 , 176/Bmding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 93.0%; Score 637; DB 2; Length 390; 

Best Local Similarity 98.2%; Pred . No. 2.4e-59; 

Matches 111; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II I I I I I I 1 I I 
Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 120 

I I I I I I I I I I II I I I : I I I II I I I I I I I I I II I I I I I I I I I I I I II I I I II I I 
Db 338 KVLALYNQHN PGASAS PCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 10 
148196 

transforming growth factor beta-1 precursor - golden hamster (fragment) 
C; Species: Mesocricetus auratus (golden hamster) 

C;Date: 02-Jul-1996 #sequence_revis ion 04-Oct-1996 #text_change 24-Nov-1999 
C; Accession : 14 8196 

R;Wong, D.T.; Donoff, R.B.; Yang, J.; Song, B.Z.; Matossian, K. ; Nagura , N . ; 
Elovic, A.; McBride, J.; Gallagher, G. ; Todd, R.; Chiang, T. ; Chou, L.S.S.; 
Yung, CM. ; Galli, S.J.; Weller, P.F. 
Am. J. Pathol. 143, 130-142, 1993 

A; Title: Sequential expression of transforming growth factors alpha and beta 1 
by eosinophils during cutaneous wound healing in the hamster. 
A;Reference number: 148196; MUID: 93304479 ; PMID: 8317544 
A; Accession : 148196 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A;Residues: 1-130 <RES> 

A; Cross- references : EMBL:X60296; NID:g396177; PIDN : CAA42838 . 1 ; PID:g396178 
C ; Superf amily : inhibin 

Query Match 91.2%; Score 62 5; DB 2; Length 13 0; 

Best Local Similarity 97.3%; Pred. No. 1.4e-58; 

Matches 110; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 



Qy 



8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 



MINI NllllllhlMIII Illllllllllllll llllillllMIIMIMI 

Db 18 RALDTNYCFSSTEKNCCVRQLY I DFRKDLGWKWI HEPKGYHANFCLGPCPY I WSLDTQYS 77 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 12 0 

Illllllllllllll I I I I II I I I I I II I I I I I II I I I I I I I I I I I I I III 
Db 78 KVLALYNQHNPGASAGPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSYKCS 130 



RESULT 11 
A41918 

transforming growth factor beta-4 precursor - chicken (fragment) 

N; Alternate names: TGF-beta 4 

C; Species: Gallus gallus (chicken) 

C;Date: 31-Dec-1993 #sequence_revision 31-Dec-1993 #text__change 16-Jul-1999 

C;Accession: A41918; A34941; S03110 

R;Burt, D.W.; Jakowlew, S.B. 

Mol. Endocrinol. 6, 989-992, 1992 

A;Title: Correction: a new interpretation of a chicken transforming growth 
factor-beta 4 complementary DNA . 

A;Reference number: A41918; MUID: 92357039 ; PMID: 1353860 
A /Access ion: A41918 
A; Molecule type: mRNA 
A;Residues: 1-373 <BUR> 

A; Cross-references: GB:M31160; GB:X08012; GB:S41706; NID : gl2 624 37 ; 
PIDN:AAB05637.1; PID:gl262438 

A;Note: sequence extracted from NCBI backbone (NCBIN: 110186, NCBIP : 110187) 

A; Note: this report corrects and reinterprets the sequence from reference A34 941 

R;Jakowlew, S.B.; Dillard, P.J. ; Sporn, M.B. ; Roberts, A.B. 

Mol. Endocrinol. 2, 1186-1195, 1988 

A; Title: Complementary deoxyribonucleic acid cloning of a messenger ribonucleic 
acid encoding transforming growth factor beta-4 from chicken embryo 
chondrocytes . 

A/Reference number: A34941; MUID : 89112198 ; PMID:2464131 
A /Access ion : A3 4 941 
A;Molecule type: mRNA 

A;Residues: 1 MDPMS I GPKSCGGS PWRP PGTAPWS I GSRRATASSSCSTSSRVRAEVGGRAL 1 , 122- 

209, 'D' ,211-373 <JAK> 

A; Cross-references : EMBL:X08 012 

A;Note: this sequence has been corrected in A41918 
C; Super family : inhibin 

C; Keywords: glycoprotein; growth factor 

F;l/Domain: signal sequence (fragment) #status predicted <SIG> 
F;223-225/Region: cell attachment (R-G-D) motif 

F;260-373/Product : transforming growth factor beta-4 #status predicted <MAT> 
F;54, 109, 153/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 79.1%; Score 542; DB 2; Length 373; 

Best Local Similarity 82.3%; Pred. No. 2.3e-49; 

Matches 93; Conservative 9; Mismatches 9; Indels 2; Gaps 1; 
Qy 10 LDTNYCF- -SSTEKNCCVRQLY I DFRKDLGWKWI HEPKGYHANFCLGPCPY I WSLDTQYS 67 

Ilhlll : I MM II MINIMI I II I II MM I Ilhlll II I II II I h 

Db 261 LDTDYCFGPGTDEKNCCVRPLYIDFRKDLQWKWIHEPKGYMANFCMGPCPYIWSADTQYT 320 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

1 1 II 1 1 Mill III III Mill hlllhlMII Mlllllhlhllll 

Db 321 KVLALYNQHNPGASAAPCCVPQTLDPLPI I YYVGRNVRVEQLSNMWRACKCS 373 



RESULT 12 
B61036 

transforming growth factor beta-5 precursor - African clawed frog 
C; Species: Xenopus laevis (African clawed frog) 

C;Date: 31-Dec-1993 #sequence_revis ion 03-Feb-1994 #text_change 16-Jul-1999 
C;Accession: A34929; B61036 

R;Kondaiah, P.; Sands, M.J.; Smith, J.M. ; Fields, A.; Roberts, A.B.; Sporn, 
M.B. ; Melton, D . A . 

J. Biol. Chem. 265, 1089-1093, 1990 

A;Title: Identification of a novel transforming growth factor-beta (TGF-beta5) 
mRNA in Xenopus laevis . 

A;Reference number: A34929; MUID : 90110090 ; PMID: 2295601 
A;Accession: A34929 
A;Molecule type: mRNA 
A;Residues: 1-382 <K0N> 

A; Cross -references : GB:J05180; NID:g214821; PIDN : AAA4 9968 . 1 ; PID:g214822 
R;Roberts, A.B.; Rosa, F.; Roche, N.S.; Coligan, J.E.; Garfield, M. ; Rebbert , 
M.L.; Kondaiah, P.; Danielpour, D. ; Kehrl , J.H. ; Wahl , S.M.; Dawid, I.B.; Sporn, 
M.B. 

Growth Factors 2, 135-147, 1990 

A;Title: Isolation and characterization of TGF-beta2 and TGF-beta5 from medium 
conditioned by Xenopus XTC cells. 

A;Reference number: A61036; MUID: 90253806; PMID:2340184 
A /Accession: B61036 
A;Molecule type: protein 

A;Residues: 271-276, »X' , 278-284 , 'XX* , 287-299 <ROB> 
C; Superf amily : inhibin 
C; Keywords: growth factor 

F;271-382/Product : transforming growth factor beta-5 #status experimental <MAT> 

Query Match 73.7%; Score 505; DB 2; Length 382; 

Best Local Similarity 74.3%; Pred . No. 1.9e-45; 

Matches 84; Conservative 12; Mismatches 17; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: = III : I I I I : I I h I I I I I I I I I I I I I I I I I h I I I I I I I I h I I I I I 
Db 270 RGVGQEYCFGNNGPNCCVKPLYINFRKDLGWKWIHEPKGYEANYCLGNCPYIWSMDTQYS 329 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

Ilhlllhlllll :||| II lllllhlllll IIMIIIhllll II 
Db 330 KVLSLYNQNNPGASISPCCVPDVLEPLPIIYYVGRTAKVEQLSNMWRSCNCS 382 



RESULT 13 
A41397 

transforming growth factor beta-3 precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 03-Apr-1992 #sequence_revision 03-Apr-1992 #text_change 16-Jul-1999 
C;Accession: A41397; A61039; A61225 

R;Miller, D.A.; Lee, A.; Matsui, Y.; Chen, E.Y.; Moses, H.L.; Derynck, R. 
Mol. Endocrinol. 3, 1926-1934, 1989 

A;Title: Complementary DNA cloning of the murine transforming growth factor- 
beta3 (TGFbeta3) precursor and the comparative expression of TGFbeta3 and 
TGFbetal messenger RNA in murine embryos and adult tissues. 
A;Reference number: A41397; MUID : 90190650 ; PMID:2628730 



A; Accession : A413 97 
A; Molecule type: mRNA 
A;Residues: 1-410 <MIL> 

A; Cross-references: GB:M32745; NID:g201949; PIDN : AAA4 0422 . 1 ; PID:g201950 
R;Denhez, F. ; Lafyatis, R. ; Kondaiah, P.; Roberts, A.B.; Sporn, M.B. 
Growth Factors 3, 139-146, 1990 

A; Title: Cloning by polymerase chain reaction of a new mouse TGF-beta, mTGF- 
beta3 . 

A;Reference number: A61039; MUID : 91000714 ; PMID:2206556 
A; Access ion: A6103 9 
A /Molecule type: mRNA 
A;Residues: 1-410 <DEN> 

R;Watrm, F . ; Scotto, L. ; Assoian, R.K.; Wolgemuth, D.J. 
Cell Growth Differ. 2, 77-83, 1991 

A; Title: Cell lineage specificity of expression of the murine transforming 

growth factor beta-3 and transforming growth factor beta-1 genes. 

A/Reference number: A61225; MUID : 91299576 ; PMID;2069871 

A /Accession: A6122 5 

A; Status: translation not shown 

A /Molecule type: mRNA 

A;Residues: 285-410 <WAT> 

C; Super family: inhibm 

C;Keywords: glycoprotein; growth factor; growth regulation 
F; 1-21 /Domain : signal sequence #status predicted <SIG> 
F; 22-298/ Doma in : propeptide #status predicted <PR0> 
F;259-261/Region: cell attachment (R-G-D) motif 

F;299-410/Product : transforming growth factor beta-3 #status predicted <MAT> 
F; 72 , 133 , 140/Binding site: carbohydrate (Asn) (covalent) tfstatus predicted 

Query Match 73.3%; Score 5 02; DB 2; Length 410; 

Best Local Similarity 76.1%; Pred. No. 4.2e-45; 

Matches 86; Conservative 12; Mismatches 15; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

■111111 : hlllll lllllhlllllhlllllhllll llllh I II 
Db 2 98 RALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSADTTHS 3 57 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

II III II llhllllll I I II hlllll llllllllhhillll 
Db 3 58 TVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 410 



RESULT 14 
A36169 

transforming growth factor beta-3 precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 14-Dec-1990 #sequence__revision 14-Dec-1990 #text_change 20-Jun-2000 
C;Accession: A36169; A41262; S01824 

R;ten Dijke, P.; Hansen, P.; Iwata, K.K.; Pieler, C. ; Foulkes, J.G. 
Proc. Natl. Acad. Sci. U.S.A. 85, 4715-4719, 1988 

A; Title: Identification of another member of the transforming growth factor type 
beta gene family. 

A;Reference number: A36169; MUID : 88263019 ; PMID: 3164476 
A;Accession: A36169 

A; Status: preliminary; not compared with conceptual translation 
A; Molecule type: mRNA 
A;Residues: 1-412 <TEN> 



A; Cross-references: GB:J03241; NID:g339551; PIDN : AAA61161 . 1 ; PID:g339552 
R;Arrick, B.A.; Lee, A.L.; Grendell, R.L.; Derynck, R. 
Mol. Cell. Biol. 11, 4306-4313, 1991 

A;Title: Inhibition of translation of transforming growth factor-beta3 mRNA by 
its 5' untranslated region. 

A,-Reference number: A41262; MUID: 91342629; PMID: 1875922 

A, -Access ion : A412 62 

A;Molecule type: DNA 

A /Residues: 1-4 8 <ARR> 

A; Cross-references : GB : M58524 

R;Derynck, R . ; Lindquist, P.B.; Lee, A.; Wen, D. ; Tamm, J.; Graycar, J.L. ; Rhee, 
L.; Mason, A.J.; Miller, D.A.; Coffey, R.J.; Moses, H.L.; Chen, E.Y. 
EMBO J. 7, 3737-3743, 1988 

A;Title: A new type of transforming growth factor-beta, TGF-beta3 . 
A;Reference number: S01824; MUID : 89091120 ; PMID:3208746 
A; Access ion : S01824 
A;Molecule type: mRNA 
A;Residues: 3-412 <DER> 

A; Cross -references : EMBL:X14885; NID:g37075; PIDN : CAA33 024 . 1 ; PID:gl200236 

C ; Genet ics : 

A; Gene: GDB : TGFB3 

A/Cross-references : GDB: 120437; OMIM: 190230 
A ; Map position: 14q24-14q24 

A;Introns: 118/1; 172/3; 216/1; 252/1; 309/2; 360/3 

C; Super family: inhibin 

C; Keywords: growth factor; homodimer 

F ; 1 -27/Doma in : signal sequence #status predicted <SIG> 
F;28-300/Domain: propeptide #status predicted <PRO> 

F;301-412/Product : transforming growth factor beta-3 #status predicted <MAT> 

Query Match 73.3%; Score 502; DB 2; Length 412; 

Best Local Similarity 76.1%; Pred . No. 4.2e-45; 

Matches 86; Conservative 12; Mismatches 15; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

:||llllll : MINI lllllhlllllhlllllhllll MM- I II 
Db 3 00 RALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSADTTHS 359 

Qy 68 KVLALYNQHN PGASAAPCCVPQALE PLP I VYYVGRKPKVEQLSNM I VRS CKCS 120 

II III II llhllllll I I I I Ml, II llllllllhhIIIM 
Db 360 TVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 412 



RESULT 15 
A55706 

transforming growth factor beta-3 precursor - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 03-Mar-1995 #sequence_revision 03-Mar-1995 #text_change 16-Jul-1999 
C;Accession: A55706; B40699; S36042 

R;Wang, J.; Kuliszewski, M. ; Yee, W. ; Sedlackova, L. ; Xu, J.; Tseu, I.; Post, M. 
J. Biol. Chem. 270, 2722-2728, 1995 

A;Title: Cloning and expression of glucocort icoid- induced genes in fetal rat 

lung fibroblasts. Transforming growth factor-beta-3 . 

A;Reference number: A55706; MUID : 95155340 ; PMID:7852342 

A;Accession: A55706 

A; Molecule type: mRNA 

A;Residues: 1-410 <WAN> 



A; Cross-references : GB:U03491 

A;Note: it is uncertain whether Met-1 is the initiator 
R;McKmnon, R.D.; Piras, G.; Ida Jr., J. A.; Dubois -Dalcq, M. 
J. Cell Biol. 121, 1397-1407, 1993 

A;Title: A role for TGF-beta in oligodendrocyte differentiation. 

A;Reference number: A40699; MU ID: 93286190; PMID: 8509457 

A; Accession : B4 06 99 

A ; Status : prel iminary 

A /Molecule type: mRNA 

A;Residues: 157-211 <MCK> 

A; Cross-references: EMBL:X71903 ; NID:g311326; PIDN : CAA50722 . 1 ; PID:g311327 
C; Super family : inhibin 

Query Match 73.1%; Score 501; DB 2; Length 410; 

Best Local Similarity 76.1%; Pred. No. 5.3e-45; 

Matches 86; Conservative 12; Mismatches 15; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: Ml I MM : MM Ml M M MM MM MM M MM Ml llllh I II M 

Db 2 98 RALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSSDTTHS 3 57 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

II Ml II M Ml II Ml MM Ml II 1 1 MIMI II IMM MM 

Db 3 58 TVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 410 
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Job time : 3.96533 sees 
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OM protein - protein search, using sw model 
Run on: 



Title: 
Perfect score: 685 



October 28, 2003, 00:08:41 ; Search time 2.4052 Seconds 

( wi t hout a 1 ignment s ) 
2346.251 Million cell updates/sec 

US-10-017-372E-11 



Sequence : 



1 DYKDDDDKALDTNYCFSSTE GRKPKVEQLSNMI VRSCKCS 12 0 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 127863 seqs, 47026705 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



127863 



Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



SwissProt 41:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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375 


1 


GDF8 SHEEP 


018830 


ovis aries 


35 


195.5 


28 


.5 


375 


1 


GDF8_CHICK 


042220 


gallus gall 


36 


195.5 


28 


. 5 


375 


1 


GDF8 HUMAN 


014793 


homo sapien 


37 


195.5 


28 


.5 


375 


1 


GDF8__MELGA 


042221 


meleagris g 


38 


195.5 


28 


.5 


375 


1 


GDF8_PIG 


018831 


sus scrota 


39 


195 .5 


28 


.5 


376 


1 


GDF8_M0USE 


008689 


mus musculu 


40 


195.5 


28 


. 5 


376 


1 


GDF8__RAT 


035312 


rattus norv 


41 


191 


27 


. 9 


424 


1 


IHBA_CHICK 


P27092 


gallus gall 


42 


190.5 


27 


. 8 


355 


1 


DVR1_BRARE 


P35621 


brachydanio 


43 


190 . 5 


27 


. 8 


375 


1 


GDF8_BOVIN 


018836 


bos taurus 


44 


188 .5 


27 


. 5 


405 


1 


GDFB__MOUSE 


Q9zlw4 


mus musculu 


45 


188 .5 


27 


. 5 


407 


1 


GDFB_HUMAN 


095390 


homo sapien 



ALIGNMENTS 



RESULT 1 
TGF1 BOVIN 



ID TGFl_BOVIN STANDARD; PRT; 315 AA. 

AC P18341; 

DT 01-NOV-1990 (Rel . 16, Created) 

DT 01-NOV-1990 (Rel. 16, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (Fragment) . 

GN TGFB1 . 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetart iodactyla ; Ruminant ia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=91042552; PubMed=3 1534 59 ; 

RA van Obberghen- Schilling E., Kondaiah P., Ludwig R.L., Sporn M.B., 

RA Baker C.C. ; 

RT "Complementary deoxyribonucleic acid cloning of bovine transforming 

RT growth factor-beta 1 . " ; 

RL Mol . Endocrinol. 1:693-698(1987). 

RN [2] 

RP SUBUNITS. 

RC TISSUE=Bone; 

RX MEDLINE=92129307; PubMed=1733936 ; 

RA Ogawa Y. , Schmidt D.K., Dasch J.R., Chang R.J., Glaser C.B.; 

RT "Purification and characterization of transforming growth factor-beta 

RT 2.3 and -beta 1.2 heterodimers from bovine bone . " ; 

RL J. Biol. Chem. 267:2325-2328(1992). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE . TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . Heterodimers of TGF-beta 1/2 

CC have been found in bone . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M36271; AAA30778.1; -. 

DR PIR; A40057; A40057. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 



DR 


InterPro; 


IPR003911; TGF TGFb . 


DR 


InterPro; 


IPR001839; TGFb. 




DR 


InterPro; 


IPR001111; TGFb_N. 




DR 


Pfam; PF00019; TGF-beta; 1. 




DR 


Pfam; PF00688; TGFb_propeptide; 1. 


DR 


PRINTS; PR00438; 


GFCYSKNOT. 




DR 


PRINTS; PRO 1423; 


TGFB ETA . 




DR 


ProDom; PD000357; 


TGFb; 1. 




fir 

UK 


SMART; SM002 04; TGFB; 1. 




nR 

LJi\ 


PROSITE; 


PS00250; 


TGF_BETA_1 ; 


1. 




Growth factor; Mitogen; Glycoprotein. 


PT 


NON_TER 


1 


1 




PT 


PROPEP 


<1 


203 




PT 
r i 


CHAIN 


204 


315 


TRANSFORMING GROWTH FACTOR BETA 1. 


PT 


DISULFID 


210 


219 


BY SIMILARITY. 


FT 


DISULFID 


218 


281 


BY SIMILARITY. 


FT 


DISULFID 


247 


312 


BY SIMILARITY. 


FT 


DISULFID 


251 


314 


BY SIMILARITY. 


FT 


DISULFID 


280 


280 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


7 


7 


N-LINKED { GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


61 


61 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


CARBOHYD 


101 


101 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


169 


171 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


315 AA 


36269 MW; 


C2717A23D994E00E CRC64; 



Query Match 93.4%; Score 640; DB 1; Length 315; 

Best Local Similarity 99.1%; Pred. No. 1.2e-61; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: ' II I I I I I I I II I I I I I I I I II I I I I I I I ! I I I M I I I I I I III I I I! I I I I I I I I I I I 
Db 203 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 262 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

. 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II I 1 1 ' 1 1 1 1 1 1 1 1 1 1 II 1 1 III 1 1 1 1 1 1 1 1 

Db 263 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 315 



RESULT 2 
TGF1_CANFA 

ID TGF1_CANFA STANDARD; PRT; 3 90 AA. 

AC P54831; 

DT 01-OCT-1996 (Rel . 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1. 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 
OX NCBI_TaxI D= 96 1 5 ; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Jugular vein endothelial; 

RX MEDLINE=9523 763 0; PubMed=7721 110 ; 

RA Manning A.M., Auchampach J. A. , Drong R.F., Slightom J.L.; 

RT "Cloning of a canine cDNA homologous to the human transforming growth 



RT factor-beta 1 -encoding gene ." ; 

RL Gene 155:307-308(1995). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulfide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

cc 

DR EMBL; L34956; AAA51458.1; -. 

DR PIR; JC4023; JC4023. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR0024 00; GF__cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 



DR 


PROSITE; 


PS00250; 


TGF BETA 1 ; 1 . 




KW 


Growth factor; Mitogen; 


Glycoprotein; Signal . 




FT 


SIGNAL 


1 


23 


POTENTIAL. 




FT 


PROPEP 


24 


278 


BY SIMILARITY. 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH 


FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 




FT 


DISULFID 


293 


356 


BY SIMILARITY. 




FT 


DISULFID 


322 


387 


BY SIMILARITY. 




FT 


DISULFID 


326 


389 


BY SIMILARITY. 




FT 


DISULFID 


355 


355 


INTERCHAIN. 




FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC . . 


.) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE 


(POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44185 MW; EB478 0E88B7B590E CRC64 ; 



Query Match 93.4%; Score 64 0; DB 1; Length 3 90; 

Best Local Similarity 99.1%; Pred. No. 1.5e-61; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

HMIIMIIIIIMIIIIIIIIIIMIIIIIIMIIIIMMIIMIIIIIIIIIIIII 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 



Qy 68 KVLALYNQHN PGAS AAPCCVPQALE PLP I VYYVGRKPKVEQLSNM I VRS CKCS 12 0 

IIIMIMIIIllllllllllllllllllllMIIMIIIMIIIIIIIIIII 

Db 33 8 KVIALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 3 
TGF1 CERAE 



ID TGF1_CERAE STANDARD; PRT; 3 90 AA. 

AC P09533; 

DT 01-MAR-1989 (Rel . 10, Created) 

DT 01-MAR-1989 (Rel. 10, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Cercopithecus aethiops (Green monkey) (Grivet) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI JTaxID=9534 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=87246074; PubMed=34 74 13 0 ; 

RA Sharpies K. , Plowman G.D., Rose T.M., Twardzik D.R., Purchio A.F.; 

RT "Cloning and sequence analysis of simian transforming growth 

RT factor-beta cDNA . 11 ; 

RL DNA 6:239-244(1987). 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROL PROLIFERATION, 
CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulfide- 1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; M16658; AAA35369.1; -. 

DR PIR; A26960; A26960. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD00 03 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 





PROSITE; 


PS00250; 


TGF BETA 1; 


1. 




Growth factor; Mitogen; Glycoprotein; Signal. 


FT 
r i 


CTPMA T 
O x vjl^M.i_l 


1 

x 


1 f, 

X, \J 


POTFNTT ATj 

x w x xjim x x nu • 


PT 

r 1 


PPPlDPD 


1 7 
x / 


£» 1 O 




PT 
r i 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


r i 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44356 MW; 


DFF63E2BAB4432 0E CRC64 ; 



Query Match 93.4%; Score 64 0; DB 1; Length 390; 

Best Local Similarity 99.1%; Pred. No. 1.5e-61; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0, 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

:| II II Mill Mill 1 1 III II II III 1 1 III II MM II I III III II III MM III 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

I Ml II Ml IMIIIMIIMII Mil: M IMII Ih lllllll Mill 1 1 

Db 338 KVTxALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 4 
TGF1_HUMAN 

ID TGF1_HUMAN STANDARD; PRT; 3 90 AA. 

AC P01137; Q9UCG4; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 OR TGFB . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCB I _Ta xID=9606; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=87174845; PubMed=3470709 ; 

RA Derynck R. , Rhee L. , Chen E.Y., van Tilburg A. ; 

RT "Intron-exon structure of the human transforming growth factor-beta 

RT precursor gene. " ; 

RL Nucleic Acids Res. 15:3188-3189(1987). 

RN [2] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=85296301; PubMed=3861940 ; 

RA Derynck R. , Jarrett J. A., Chen E.Y., Eaton D.H., Bell J.R. , 

RA Assoian R.K., Roberts A.B., Sporn M.B., Goeddel D.V. ; 

RT "Human transforming growth factor-beta complementary DNA sequence and 

RT expression in normal and transformed cells."; 

RL Nature 316:701-705(1985). 



RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Duodenum, and Eye; 

RX MEDLINE=22388257 ; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G. , 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T. , Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M. , Hong L. , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H. , 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J. , Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E . , Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A. , Young A.C., Shevchenko Y., Bouffard G.G. # 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [4] 

RP SEQUENCE OF 279-390 FROM N.A. 

RC TISSUE=Carcinoma ; 

RA Urushizaki Y. , Niitsu Y., Terui T. , Koshida Y. , Mahara K. , Kohgo Y., 

RA Urushizaki I . , Takahashi Y. , Ito H. ; 

RT "Cloning and expression of the gene for human transforming growth 

RT factor-beta in Escherichia coli."; 

RL Tumor Res. 22:41-55(1987). 

RN [5] 

RP SEQUENCE OF 279-329. 

RC TISSUE=Bladder carcinoma; 

RX MEDLINE=9322 9900; PubMed=84 7 1 84 6 ; 

RA Bourdrel L., Lin C.-H., Lauren S.L., Elmore R.H., Sugarman B.J., 

RA Hu S., WestCOtt K.R.; 

RT "Recombinant human transforming growth factor-beta 1: expression by 

RT Chinese hamster ovary cells, isolation, and characterization."; 

RL Protein Expr. Purif. 4:130-140(1993). 

RN [6] 

RP SEQUENCE OF 279-301. 

RX MEDLINE=85131019; PubMed=2 98282 9 ; 

RA Massague J., Like B.; 

RT "Cellular receptors for type beta transforming growth factor. Ligand 

RT binding and affinity labeling in human and rodent cell lines."; 

RL J. Biol. Chem. 260:2636-264 5(1985). 

RN [7] 

RP STRUCTURE BY NMR OF 279-390. 

RX MEDLINE=93144319; PubMed=8424 942 ; 

RA Archer S.J., Bax A . , Roberts A.B., Sporn M.B., Ogawa Y., Piez K.A. , 

RA Weatherbee J. A. , Tsang M.L.-S., Lucas R., Zheng B.-L., Wenker J . , 

RA Torchia D.A. ; 

RT "Transforming growth factor beta 1: NMR signal assignments of the 

RT recombinant protein expressed and isotopically enriched using Chinese 

RT hamster ovary cells."; 



RL Biochemistry 32:1152-1163(1993) . 

RN [8] 

RP STRUCTURE BY NMR OF 279-390. 

RX MEDLINE= 93 14432 0; PubMed=8424 943 ; 

RA Archer S.J., Bax A. , Roberts A.B., Sporn M.B., Ogawa Y. , Piez K.A. , 

RA Weatherbee J. A., Tsang M.L.-S., Lucas R . , Zheng B.-L., Wenker J., 

RA Torchia D. A. ; 

RT "Transforming growth factor beta 1: secondary structure as determined 

RT by heteronuclear magnetic resonance spectroscopy."; 

RL Biochemistry 32:1164-1171(1993). 

RN [9] 

RP STRUCTURE BY NMR OF 279-390. 

RX MEDLINE=96266150; PubMed=8 6796 13 ; 

RA Hinck A. P., Archer S.J., Qian S.W., Roberts A.B., Sporn M.B., 

RA Weatherbee J. A., Tsang M.L.-S., Lucas R. , Zheng B.-L., Wenker J., 

RA Torchia D.A. ; 

RT "Transforming growth factor beta 1: three-dimensional structure in 

RT solution and comparison with the X-ray structure of transforming 

RT growth factor beta 2 . " ; 

RL Biochemistry 35:8517-8534(1996). 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROLS PROLIFERATION, 
CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide- 1 inked . 
CC SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; X05839; CAA29283.1; -. 

DR EMBL; X05840; CAA29283.1; JOINED. 

DR EMBL; X05843; CAA29283.1; JOINED. 

DR EMBL; X05844; CAA29283.1; JOINED. 

DR EMBL; X05849; CAA29283.1; JOINED. 

DR EMBL; X05850; CAA29283.1; JOINED. 

DR EMBL; X02812; CAA26580.1; ALT_SEQ . 

DR EMBL; BC001180; AAH01180.1; -. 

DR EMBL; BC000125; AAH00125.1; 

DR EMBL; BC022242; AAH22242.1; 

DR EMBL; M38449; AAA36735.1; -. 

DR PIR; A27513; WFHU2 . 

DR PDB; 1KLA; 17 -AUG- 96. 

DR PDB; 1KLC; 17 -AUG- 96. 

DR PDB; 1KLD; 17-AUG-96. 

DR Genew; HGNC: 11766; TGFB1 . 

DR MIM; 190180; -. 

DR GO; GO:0006916; P : ant i -apoptos is ; TAS . 

DR InterPro; IPR002400; GF_cysknot . 



DR 

UK 


InterPro; 


IPR003911; TGF_TGFb. 


DR 

UK 


InterPro; 


IPR001839; TGFb . 




DR 


InterPro; 


IPR001111; TGFb_N . 




UK 


Pfam; PF00019; TGF-beta; 1. 




DR 

UK 


Pfam; PF00688; TGFb_propept ide ; 1. 


DR 


PRINTS; PR00438; 


GFCYSKNOT. 




dr 


PRINTS; PR01423; 


TGFB ETA . 




nR 

UK 


ProDom ; PDO 0 0 3 5 7 ; 


TGFb; 1. 




hr 

UK 


SMART; SM002 04; TGFB ; 1. 




DR 


PROSITE; 


PS00250; 


TGF_BETA_1 ; 


1 . 


x\rV 


Growth factor; Mitogen; Glycoprotein; Signal; 3D-structure . 


RT 


S I GNAL 


1 


23 


POTENTIAL . 


RT 


PROPEP 


24 


278 




RT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1 


UT 

r i 


DISULFID 


285 


294 




DT 

r i 


DISULFID 

i — ' X i_v XJ L X I—* 


293 


356 




RT 


DISULFID 


322 


387 




RT 


DISULFID 


326 


389 




RT 
r i 


DISULFID 


355 


355 


INTERCHAIN 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


RT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


RT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


RT 


CONFLICT 


10 


10 


L -> P (IN REF. 2) . 


RT 
r i 


CONFLICT 


159 


159 


R -> RR (IN REF. 2) . 


FT 


STRAND 


281 


281 




RT 


TURN 


282 


287 




RT 
r ± 


STRAND 


294 


296 




RT 
r i 


STRAND 


300 


300 




RT 
r i 


TURN 


302 


305 




FT 


STRAND 


313 


313 




FT 


STRAND 


317 


317 




FT 


STRAND 


321 


323 




FT 


HELIX 


335 


346 




FT 


TURN 


348 


349 




FT 


STRAND 


358 


370 




FT 


TURN 


371 


372 




FT 


STRAND 


373 


387 




SQ 


SEQUENCE 


390 AA; 44341 MW; 


75391614250288FE CRC64 ; 



Query Match 93.4%; Score 64 0; DB 1; Length 390; 

Best Local Similarity 99.1%; Pred. No. 1.5e-61; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: 11 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 I II 1 1 1 1 1 1 1 MMI M 1 1 1 1 1 1 1 1 1 Ml 1 1 1 Ml II 1 1 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

Mill 1 1 i 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 i I i 1 1 1 1 1 1 1 1 1 1 1 

Db 338 KVLALYNQHN PGASAA PCCVPQALE PLP I VY YVGRKPKVEQLSNM I VRS CKCS 3 90 



RESULT 5 
TGF1_PIG 

ID TGF1_PIG STANDARD; PRT; 3 90 AA. 

AC P07200; P08832; 



DT Ol-APR-1988 (Rel . 07, Created) 

DT Ol-APR-1988 (Rel. 07, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla ; Suma; Suidae; Sus. 

OX NCBI_TaxID=9823; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC TISSUE=Ovary; 

RX MEDLINE=87174844 ; PubMed=3470708 ; 

RA Derynck R . , Rhee L . ; 

RT "Sequence of the porcine transforming growth factor-beta precursor."; 

RL Nucleic Acids Res. 15:3187-3187(1987). 

RN [2] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RC STRAIN=Miniature swine; 

RX MEDLINE=89054010; PubMed=24 61367 ; 

RA Kondaiah P., van Obberghen-Schill ing E., Ludwig R.L., Dhar R. , 

RA Sporn M.B., Roberts A.B.; 

RT "cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 

RT Evidence for alternate splicing and polyadenylat ion . " ; 

RL J. Biol. Chem. 263:18313-18317(1988). 

RN [3] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RX MEDLINE=8833 563 9; PubMed=3 16652 0 ; 

RA Jakowlew S.B., Dillard P.J., Sporn M.B., Roberts A.B.; 

RT "Nucleotide sequence of chicken transforming growth factor-beta 1 

RT (TGF-beta 1) 

RL Nucleic Acids Res. 16:8730-8730(1988). 

RN [4] 

RP SHOWS THAT REF . 3 SEQUENCE IS FROM PIG. 

RA Jakowlew S.B. ; 

RL Unpublished observations (MAR-1996) . 

RN [5] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RA Wimmers K. , Chomdej S., Ponsuksili S., Schellander K. ; 

RT "Polymorphism in the porcine transforming growth factor beta 1 

RT gene . 11 ; 

RL Submitted (DEC-2001) to the EMBL/GenBank/DDBJ databases. 

RN [6] 

RP SEQUENCE OF 279-322. 

RX MEDLINE=871 02890; PubMed=287 963 5 ; 

RA Cheifetz S., Weatherbee J. A. , Tsang M.L.S., Anderson J.K., Mole J.E., 

RA Lucas R. , Massague J. ; 

RT "The transforming growth factor-beta system, a complex pattern of 

RT cross-reactive ligands and receptors."; 

RL Cell 48:409-415(1987). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide- 1 inked . 



CC -!- SUBCELLULAR LOCATION : Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC -!- CAUTION: REF. 3 SEQUENCE WHICH WAS SAID TO ORIGINATE FROM CHICKEN 
CC WHITE LEGHORN, SEEMS (REF.4) TO ORIGINATE FROM PIG . 

CC 

CC This SWISS-PROT entry is copyright- It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch) . 

CC 

DR EMBL; Y00111; CAA68291.1; -. 

DR EMBL; M23703; AAA64616.1; -. 

DR EMBL; X12373; CAA30933.1; -. 

DR EMBL; AF461808; AAL57902.1; -. 

DR PIR; A27512; A27512 . 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR003 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal; Polymorphism. 



FT 


SIGNAL 


1 


23 


POTENTIAL . 




FT 


PROPEP 


24 


278 






FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH 


FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 




FT 


DISULFID 


293 


356 


BY SIMILARITY. 




FT 


DISULFID 


322 


387 


BY SIMILARITY. 




FT 


DISULFID 


326 


389 


BY SIMILARITY. 




FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N- LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N- LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE 


(POTENTIAL) . 


FT 


VARIANT 


114 


114 


L -> V. 




FT 


CONFLICT 


6 


7 


LR -> PG (IN REF. 3) 




FT 


CONFLICT 


180 


180 


R -> G (IN REF. 3) . 




FT 


CONFLICT 


237 


237 


N -> NA (IN REF. 3) . 




SQ 


SEQUENCE 


390 AA; 


44294 


MW; A6E2C3659FC384E6 CRC64 ; 



Query Match 93.4%; Score 64 0; DB 1; Length 3 90; 

Best Local Similarity 99.1%; Pred. No. 1.5e-61; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: i i i ] ! 1 1 1 1 1 1 1 1 1 ' 1 1 1 1 ; 1 1 1 1 1 1 1 . : 1 1 1 1 1 1 1 1 i : i ^ 1 1 1 ! 1 1 • 1 1 1 1 1 ; I 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 



Qy 68 KVLiALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 338 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 6 
TGF1 SHEEP 



ID TGF1_SHEEP STANDARD; PRT; 390 AA . 

AC P50414; 

DT 01-OCT-1996 (Rel . 34, Created) 

DT 01 -OCT- 1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Ovis aries (Sheep) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetart iodactyla ; Ruminant ia; Pecora; Bovoidea; 

OC Bovidae; Caprinae; Ovis. 

OX NCBI_TaxID=9940; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=95121932; PubMed=782 1 8 0 9 ; 

RA Woodall C.J., McLaren L.J., Watt N.J.; 

RT "Sequence and chromosomal localisation of the gene encoding ovine 

RT latent transforming growth factor-beta 1 . " ; 

RL Gene 150:371-373(1994). 

RN [2] 

RP SEQUENCE OF 281-390 FROM N.A. 

RC STRAIN=Merino ; TISSUE=Skin; 

RX MEDLINE=95268698; PubMed=774 962 1 ; 

RA Sutton R., Ward W.G. , Raphael K.A. , Cam G.R.; 

RT "Growth factor expression in skin during wool follicle development."; 

RL Comp. Biochem. Physiol. HOB : 697 -705 ( 1995 ) . 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul f ide- 1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; X76916; CAA54242.1; -. 

DR EMBL; L36038; AAA31526.1; -. 

DR PIR; 146463; 146463. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro ; IPR003 911; TGF_TGFb. 



DR InterPro; IPR001839; TGFb. 

DR InterPro ; IPR001111; TGFb__N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

POTENTIAL . 
POTENTIAL . 

TRANSFORMING GROWTH FACTOR BETA 1. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
BY SIMILARITY. 
INTERCHAIN (BY SIMILARITY) . 
N-LINKED ( GLCNAC . . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
N-LINKED (GLCNAC. . .) (POTENTIAL). 
CELL ATTACHMENT SITE (POTENTIAL) . 
MW; 1C247299484D0E57 CRC64 ; 

Score 64 0; DB 1; Length 3 90; 
Best Local Similarity 99.1%; Pred. No. 1.5e-61; 

Matches 112; Conservative 1; Mismatches 0; Indels 0; Gaps 0, 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

:| 1 1 M 1 1 1 1 i Ml 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 i I II 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCC VPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 120 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 I M 1 1 1 1 1 II 1 1 1 1 M I II 1 1 1 : 1 1 1 1 1 1 II I 1 1 

Db 338 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



FT 


SIGNAL 


1 


23 


FT 


PROPEP 


24 


278 


FT 


CHAIN 


279 


390 


FT 


DISULFID 


285 


294 


FT 


DISULFID 


293 


356 


FT 


DISULFID 


322 


387 


FT 


DISULFID 


326 


389 


FT 


DISULFID 


355 


355 


FT 


CARBOHYD 


82 


82 


FT 


CARBOHYD 


136 


136 


FT 


CARBOHYD 


176 


176 


FT 


SITE 


244 


246 


SQ 


SEQUENCE 


390 AA; 


442 


Query Match 




93 



RESULT 7 
TGFl_MOUSE 

ID TGFl_MOUSE STANDARD; PRT; 3 90 AA. 

AC P04202; 

DT 20-MAR-1987 (Rel . 04, Created) 

DT 20-MAR-1987 (Rel. 04, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID= 10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=86168129; PubMed=3007454 ; 

RA Derynck R., Jarrett J. A. , Chen E.Y., Goeddel D.V.; 

RT "The murine transforming growth factor-beta precursor."; 

RL J. Biol. Chem. 261:4377-4379(1986). 

RN [2] 



RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RX MEDLINE=96096545; PubMed=8 5222 0 0 ; 

RA Guron C. , Sudarshan C. , Raghow R.; 

RT "Molecular organization of the gene encoding murine transforming 

RT growth factor beta 1 . 11 ; 

RL Gene 165:325-326(1995). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6, and NOD/LT; TISSUE-Spleen / 

RA Poirot L., Benoist C. , Mathis D. ; 

RT "Transforming growth factor-beta 1 sequence and expression: no 

RT difference between NOD/Lt and C57B1/6 mouse strains."; 

RL Submitted (AUG-1998) to the EMBL/ GenBank/DDB J databases. 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

cc 



DR 


EMBL; 


M13177; 


AAA40423 


1, 




DR 


EMBL ; 


L42462; 


AAB00138 


1, 




DR 


EMBL; 


L42456; 


AAB00138 


1, 


JOINED 


DR 


EMBL; 


L42457; 


AAB00138 


1, 


JOINED 


DR 


EMBL; 


L42458; 


AAB00138 


1, 


JOINED 


DR 


EMBL; 


L42459; 


AAB00138 


1, 


JOINED 


DR 


EMBL; 


L42460; 


AAB00138 


1, 


JOINED 


DR 


EMBL; 


L42461; 


AAB00138 


1, 


JOINED 


DR 


EMBL; 


AJ009862; CAA08900. 


1; 


DR 


PIR; 


A01396; 


WFMS2 . 






DR 


HSSP; 


P01137; 


1KLA . 






DR 


MGD; 


MGI : 98725; Tgfbl . 







DR GO; GO: 0005578; C : extracel lular matrix; IDA. 

DR GO; GO: 0006954; P : inflammatory response; IMP. 

DR GO; GO: 0007515; P : lymph gland development; IMP. 

DR GO; GO:0008220; P:necrosis; IMP. 

DR GO; GO:0016202; P:regulation of myogenesis; IDA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro ; I PRO 03 911 ; TGF_TGFb. 

DR InterPro; I PRO 0183 9; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PRO 0438; GFCYSKNOT . 

DR PRINTS; PRO 142 3; TGFB ETA . 



DR 


ProDom; PD000357; 


TGFb; 


1 . 






SMART; SM00204; TGFB; 1 






np 


PROSITE; 


PS00250; 


TGF_BETA_1 ; 


1. 




Growth factor; Mitogen; 


Glycoprotein; Signal . 


FT 


SIGNAL 


1 


23 




POTENTIAL. 


r i 


PROPEP 


24 


278 






PT 


CHAIN 


279 


390 




TRANSFORMING GROWTH FACTOR BETA 1 . 


PT 


DISULFID 


285 


294 




BY SIMILARITY. 


FT 


DISULFID 


293 


356 




BY SIMILARITY. 


FT 


DISULFID 


322 


387 




BY SIMILARITY. 


FT 


DISULFID 


326 


389 




BY SIMILARITY. 


FT 


DISULFID 


355 


355 




INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 




N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 




CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44310 MW; 


4381A51B711D689E CRC64 ; 


Query Match 




93 


.0%; 


Score 637; DB 1; Length 3 90; 



Best Local Similarity 98.2%; Pred. No. 3.1e-61; 

Matches 111; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 



Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: 1 1 ! 1 1 1 1 1 1 II 1 1 1 ! I ;l I II I ' 1 1: 1 1 1 1! 1 1 1 1 II Ml 1 1 1 1 1 1 1 1 M I II I I M 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 120 

1 1 1 1 1 1 1 1 1 1 II 1 1 h 1 1 ; 1 1 Ml : I M II I II 1 1 1 1 II I M 1 1 1 1 1 1 M M I 

Db 338 KVLALYNQHNPGASASPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 

RESULT 8 
TGF1_RAT 

ID TGF1_RAT STANDARD; PRT; 3 90 AA. 

AC P17246; 

DT 01-AUG-1990 (Rel. 15, Created) 

DT 01-AUG-1990 (Rel. 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Sprague-Dawley; TISSUE=Heart ; 

RX MEDLINE=90272425; PubMed=234 9 1 08 ; 

RA Qian S.W., Kondaiah P., Roberts A.B., Sporn M.B.; 

RT 11 cDNA cloning by PCR of rat transforming growth factor beta-1." ; 

RL Nucleic Acids Res. 18:3059-3059(1990). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE . TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 



CC -!- SUBUNIT: Homodimer; disulf ide-1 inked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; X52498; CAA36741.1; -. 

DR PIR; S10219; S10219. 

DR HSSP; P01137; 1KLA. 

DR InterPro; I PRO 024 00; GF_cysknot . 

DR InterPro; I PRO 03 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta ; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PRO 043 8; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD0 0 03 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


278 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N- LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44329 MW; 


5E21108ED50D853C CRC64; 


Query Match 




93.0%; 


Score 637; DB 1; Length 3 90; 



Best Local Similarity 98.2%; Pred . No. 3.1e-61; 

Matches 111; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

M I II I! 1 1 1 ! 1 1 1 II 1 1 1 1 IN 1 1 1 1 I III 1 1 : 1 1 : II 1 1 1 1 ! 1 1 1 II 1 1 1 1 1 II I II 

Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

M 1 1 'I I . II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II I h 1 1 II I II 1 1 1 1 1 1 1 1 1 I II 1 1 

Db 338 KVLALYNQHNPGASASPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 90 



RESULT 9 
TGF1_H0RSE 

ID TGFl_HORSE STANDARD; PRT; 3 90 AA. 



AC 019011; 

DT 15-JUL-1998 (Rel . 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Equus caballus (Horse) . 

0C Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

0C Mammalia; Eutheria; Perissodactyla ; Equidae; Equus. 

OX NCBIJTaxID=9796; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lymph node; 

RX MEDLINE=9818 55 07; PubMed= 9524 8 1 9 ; 

RA Penha-Goncalves M.N., Onions D.E., Nicolson L. ; 

RT "Cloning and sequencing of equine transforming growth factor-beta 1 

RT (TGF beta-1) cDNA . " ; 

RL DNA Seq. 7:375-378(1997). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulfide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; X99438; CAA67801.1; 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR Inter Pro; IPR003 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PRO 043 8; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 23 POTENTIAL. 

FT PROPEP 24 278 BY SIMILARITY. 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 285 294 BY SIMILARITY. 

FT DISULFID 293 356 BY SIMILARITY. 

FT DISULFID 322 387 BY SIMILARITY. 

FT DISULFID 326 389 BY SIMILARITY. 



FT 


DISULFID 


355 


355 


FT 


CARBOHYD 


82 


82 


FT 


CARBOHYD 


136 


136 


FT 


CARBOHYD 


176 


176 


SQ 


SEQUENCE 


390 AA; 


43974 



INTERCHAIN (BY SIMILARITY) . 
N-LINKED (GLCNAC . - - ) (POTENTIAL) . 
N-LINKED (GLCNAC . . .) (POTENTIAL). 
N-LINKED ( GLCNAC . . . ) (POTENTIAL). 
MW; A86D715F44549691 CRC64 ; 



Query Match 92.8%; Score 636; DB 1; Length 3 90; 

Best Local Similarity 98.2%; Pred. No. 4e-61; 

Matches 111; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLG'WKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 278 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 33 7 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

I I I I I I I I II I I II I I I II I I I I I I I I II I I I I II I II I I I I I II I II I I I I 
Db 338 KVLALYNQHNPGASAAPCCVPQVLEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 10 
TGFl_CAVPO 

ID TGFl_CAVPO STANDARD; PRT; 3 90 AA. 

AC Q9Z1Y6; Q9QZB3 ; Q9R148; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI_TaxID=10141; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Hartley; 

RA Jeevan A., McMurray D.N., Yoshimura T. ; 

RT "Guinea pig transforming growth factor-beta in peritoneal exudates 

RT after BCG vaccination. " ; 

RL Submitted (OCT-1999) to the EMBL/ GenBank/ DDB J databases. 

RN [2] 

RP SEQUENCE OF 265-382 FROM N.A. 

RX MEDLINE=99144670 ; PubMed=10025978 ; 

RA Scarozza A.M., Ramsingh A.I., Wicher V., Wicher K. ; 

RT "Spontaneous cytokine gene expression in normal guinea pig blood and 

RT tissues . " ; 

RL Cytokine 10:851-859(1998). 

RN [3] 

RP SEQUENCE OF 279-371 FROM N.A. 

RL STRAIN=Hartley ; TISSUE=Trachea ; 

RA Morishima Y. , Uchida Y. , Nomura A., Ishii Y . , Sakamoto T. , 

RA Sekizawa K. ; 

RT "Guinea-pig transforming growth factor-beta expression in injured 

RT tracheal epithelium."; 

RL Submitted (JUL- 1999) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROLS PROLIFERATION, 
CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF THEM HAVE 



CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC SUBUNIT: Homodimer; disulf ide-linked (By similarity). 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformat ics and the EMBL outstation - 

CC the European Bioinformat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

cc 

DR EMBL; AF191297; AAF02780.1; -. 

DR EMBL; AF097509; AAC83807.1; -. 

DR EMBL; AF169347; AAD49347.1; -. 

DR HSSP; P01137; lKLA. 

DR InterPro; IPR002400; GF__cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 



DR 


PRINTS; 


PR00438; 


GFCYSKNOT. 




DR 


PRINTS; 


PR01423; 


TGFB ETA . 




DR 


ProDom; 


PD000357; 


TGFb; 


1. 




DR 


SMART; SM00204; TGFB; 1 






DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1. 


KW 


Growth factor; Mitogen; 


Glycoprotein; Signal . 


FT 


SIGNAL 


1 


24 




POTENTIAL. 


FT 


PROPEP 


25 


278 




POTENTIAL. 


FT 


CHAIN 


279 


390 




TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 




BY SIMILARITY. 


FT 


DISULFID 


293 


356 




BY SIMILARITY. 


FT 


DISULFID 


322 


387 




BY SIMILARITY. 


FT 


DISULFID 


326 


389 




BY SIMILARITY. 


FT 


DISULFID 


355 


355 




INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 




N-LINKED (GLCNAC. . .) ( POTENTIAL) 


FT 


SITE 


244 


246 




CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


CONFLICT 


279 


279 




G - > P (IN REF. 3) . 


FT 


CONFLICT 


286 


286 




F -> S (IN REF. 2) . 


FT 


CONFLICT 


309 


309 




K -> E (IN REF. 2) . 


FT 


CONFLICT 


322 


322 




C -> R (IN REF. 2) . 


FT 


CONFLICT 


350 


350 




A -> G (IN REF. 2) . 


SQ 


SEQUENCE 


390 AA; 44328 MW ; 


1539F849BA0C0FF1 CRC64 ; 


Query Match 




91 


7%; 


Score 62 8; DB 1; Length 390; 



Best Local Similarity 97.3%; Pred. No. 2.9e-60; 

Matches 110; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

= MIIIIMIIIMIMIIIIIIIIIIIIIIIMMIIIIIIIMIIMIMIIIMII 

Db 278 RGLDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 



Qy 68 KVTALYNQHNPGASAAPCCVPQALEPLPIVTYVGRKPKVEQLSNMIVRSCKCS 120 

IIIIIIIIIIIIIIIIIIIIIIIIIIIMMIIIII II 1 , 1 "; 

Db 338 KVLALYNQHN PGAS AAPCCVPQALE PLP I VY YVGRKAKVEQLSNM I VRS CKCS 390 



RESULT 11 
TGF1 CHICK 



ID TGF1JZHICK STANDARD; PRT; 373 AA. 

AC P09531; 

DT 01-MAR-1989 (Rel. 10, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (TGF-beta 4) 

DE (Fragment) . 

GN TGFB1 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae ; Gall if ormes ; Phasianidae; Phasianinae,- 

OC Gallus. 

OX NCBI__TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=White leghorn; 

RX MEDLINE=89112198; PubMed=24 64 13 1 ; 

RA Jakowlew S.B., Dillard P.J., Sporn M.B., Roberts A.B.; 

RT "Complementary deoxyribonucleic acid cloning of a messenger 

RT ribonucleic acid encoding transforming growth factor beta 4 from 

RT chicken embryo chondrocytes."; 

RL Mol. Endocrinol. 2:1186-1195(1988). 

RN [2] 

RP REVISIONS . 

RX MEDLINE=92357039; PubMed= 13 53 8 60 ; 

RA Burt D.W., Jakowlew S.B.; 

RT "Correction: a new interpretation of a chicken transforming growth 

RT factor-beta 4 complementary DNA . 11 ; 

RL Mol. Endocrinol. 6:989-992(1992). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M31160; AAB05637.1; -. 

DR PIR; A41918; A41918. 



DR HSSP; P01137; 1KLA. 

DR InterPro; IPR003 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 



DR 


PRINTS; PR01423; 


TGFB ETA . 




DR 


ProDom; PD0 00357; 


TGFb; 1. 




DR 


SMART; SM00204; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF_BETA_1 ; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


NON TER 


1 


1 




FT 


SIGNAL 


<1 


1 


POTENTIAL. 


FT 


PROPEP 


2 


259 


POTENTIAL. 


FT 


CHAIN 


260 


373 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


266 


277 


BY SIMILARITY. 


FT 


DISULFID 


276 


339 


BY SIMILARITY. 


FT 


DISULFID 


305 


370 


BY SIMILARITY. 


FT 


DISULFID 


309 


372 


BY SIMILARITY. 


FT 


DISULFID 


338 


338 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


54 


54 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


109 


109 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


CARBOHYD 


153 


153 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


224 


226 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


373 AA 


; 42634 MW; 


9903F3479C8552E5 CRC64 ; 



Query Match 79.1%; Score 542; DB 1; Length 373; 

Best Local Similarity 82.3%; Pred. No. 5.2e-51; 

Matches 93; Conservative 9; Mismatches 9; Indels 2; Gaps 1; 
Qy 10 LDTNYCF - - SSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYS 67 

llhlll = lllllll MINIMI MINIUM NlhllNMN lllh 

Db 261 LDTDYCFGPGTDEKNCCVRPLYI DFRKDLQWKW1 HEPKGYMANFCMGPCPYIVJSADTQYT 32 0 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 12 0 

INIMIIMIIIIINMIN hlllhlllll MlllllhlhNN 

Db 321 KVXiALYNQHNPGASAAPCCVPQTLDPLPI I YYVGRNVRVEQLSNMWRACKCS 373 



RESULT 12 
TGF1_XENLA 

ID TGF1_XENLA STANDARD; PRT; 382 AA. 

AC P16176; 

DT 01-APR-1990 (Rel. 14, Created) 

DT 01-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (TGF-beta 5) . 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pipoidea; Pipidae; 

OC Xenopodinae ; Xenopus . 

OX NCBI_TaxID=83 55; 
RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=901 10090; PubMed=22 9 5601 ; 

RA Kondaiah P., Sands M.J., Smith J.M., Fields A., Roberts A.B., 

RA Sporn M.B., Melton D.A. ; 



RT "Identification of a novel transforming growth factor-beta (TGF-beta 

RT 5) mRNA in Xenopus laevis." ; 

RL J. Biol. Chem. 265:1089-1093(1990). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Vempati U.D., Kondaiah P.; 

RL Submitted (AUG-1997) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: IMPORTANT ROLE IN CERTAIN ASPECTS OF DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disulfide- linked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; J05180; AAA49968.1; -. 

DR EMBL; AF009335; AAB64441.1; -. 

DR EMBL; AF009331; AAB64441.1; JOINED. 

DR EMBL; AF009332; AAB64441.1; JOINED. 

DR EMBL; AF009333; AAB64441.1; JOINED. 

DR EMBL; AF009334; AAB64441.1; JOINED. 

DR PIR; A34929; B61036. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro ; IPR003911; TGF_TGFb . 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide; 1. 



DR 


PRINTS; : 


PR00438 ; 


GFCYSKNOT . 




DR 


PRINTS; ! 


PR01423; 


TGFB ETA . 




DR 


ProDom; : 


PD000357; 


TGFb; 1. 




DR 


SMART; SM00204; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


21 


POTENTIAL. 


FT 


PROPEP 


22 


270 




FT 


CHAIN 


271 


382 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


277 


286 


BY SIMILARITY. 


FT 


DISULFID 


285 


348 


BY SIMILARITY. 


FT 


DISULFID 


314 


379 


BY SIMILARITY. 


FT 


DISULFID 


318 


381 


BY SIMILARITY. 


FT 


DISULFID 


347 


347 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


73 


73 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


CARBOHYD 


123 


123 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


166 


166 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


234 


236 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


382 AA 


; 44200 MW; 


1034621C917AAE15 CRC64 ; 



Query Match 73.7%; Score 505; DB 1; Length 382; 

Best Local Similarity 74.3%; Pred. No. 5.1e-47; 

Matches 84; Conservative 12; Mismatches 17; Indels 0; Gaps 0; 



Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: : III : lllh I I h I I I I I I I I I I I I I I I I Ihlll IIIHMIUI 
Db 270 RGVGQEYCFGNNGPNCCVKPLYINFRKDLGWKWIHEPKGYEANYCLGNCPYIWSMDTQYS 32 9 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 12 0 

I I I : I > I : I I I I :||||| lllllhllll. Mlllllhllll II 
Db 330 KVLSLYNQNNPGAS I S PCCVPDVLEPLP 1 1 YYVGRTAKVEQLSNM WRSCNCS 382 



RESULT 13 
TGF3__MOUSE 

ID TGF3_MOUSE STANDARD; PRT; 410 AA. 

AC P17125; 

DT 01-AUG-1990 (Rel . 15, Created) 

DT 01-AUG-1990 (Rel. 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Euthena; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=90190650; PubMed=262873 0 ; 

RA Miller D.A. , Lee A., Matsui Y. , Chen E.Y., Moses H.L., Derynck R. ; 

RT "Complementary DNA cloning of the murine transforming growth 

RT factor-beta 3 (TGF beta 3) precursor and the comparative expression 

RT of TGF beta 3 and TGF beta 1 messenger RNA in murine embryos and 

RT adult tissues . 11 ; 

RL Mol . Endocrinol. 3:1926-1934(1989). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=91000714 ; PubMed=22 06556 ; 

RA Denhez F., Lafyatis R., Kondaiah P., Roberts A.B., Sporn M.B.; 

RT "Cloning by polymerase chain reaction of a new mouse TGF-beta, 

RT mTGF-beta 3 . " ; 

RL Growth Factors 3:139-146(1990). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENESI S AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disul fide-linked. 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M32745; AAA40422.1; -. 

DR PIR; A41397; A41397. 

DR HSSP; P10600; 1TGJ. 

DR MGD; MGI: 9872 7; Tgf b3 . 

DR InterPro; IPR002400; GF_cysknot . 



UK 


InterPro; 


IPR003911; TGF_TGFb. 


nn 
UK 


InterPro; 


IPR001839; TGFb . 




UK 


InterPro ; 


IPR001111; TGFb_N. 




DR 


Pfam; PF00019; TGF-beta; 1. 




DR 


Pfam; PF00688; TGFb_propept ide ; 1. 


UK 


PRINTS; PR00438; 


GFCYSKNOT. 




UK 


PRINTS; PR01423; 


TGFB ETA . 




UK 


ProDom; PD000357; 


TGFb; 1. 




DR 


SMART; SM0 02 04; TGFB; 1. 




T-\T-> 
UK 


PROSITE; 


PS00250; 


TGF BETA 1; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


r 1 


SIGNAL 


1 


23 


POTENTIAL. 


r I 


PROPEP 


24 


298 


POTENTIAL. 


FT 


CHAIN 


299 


410 


TRANSFORMING GROWTH FACTOR BETA 3. 


FT 


DISULFID 


305 


314 


BY SIMILARITY. 


FT 


DISULFID 


313 


376 


BY SIMILARITY. 


FT 


DISULFID 


342 


407 


BY SIMILARITY. 


FT 


DISULFID 


346 


409 


BY SIMILARITY. 


FT 


DISULFID 


375 


375 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


72 


72 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


133 


133 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


140 


140 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


SITE 


259 


261 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


410 AA 


>; 46884 MW; 


250F7048CA432BD6 CRC64 ; 


Query Match 




73.3%; 


Score 5 02; DB 1; Length 410; 



Best Local Similarity 76.1%; Pred. No. 1.2e-46; 

Matches 86; Conservative 12; Mismatches 15; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

III' : hlllll lllllhlllllhlllllhllll III I II 
Db 298 RALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSADTTHS 3 57 

Qy 68 KVLAL YNQHN PGASAA PCCVPQALE PLP I VY Y VGRKPKVEQLSNM I VRSCKCS 120 

II III II llhllllll I I I I hlllll llllllllhhillll 

Db 358 TVLGLYNTLN P EASAS PCCVPQDLE PLT I LY Y VGRTPKVEQLSNM WKSCKCS 410 



RESULT 14 
TGF3_HUMAN 

ID TGF3_HUMAN STANDARD; PRT; 412 AA. 

AC P10600; 

DT 01-JUL-1989 (Rel . 11, Created) 

DT 01-JUL-1989 (Rel. 11, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
OX NCBI __Tax I D= 9 6 0 6 ; 
RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=88263 019; PubMed=3 1644 76 ; 

RA ten Dijke P., Hansen P., Iwata K. , Pieler C. , Foulkes J.G.; 

RT "Identification of another member of the transforming growth factor 

RT type beta gene family."; 



RL Proc. Natl. Acad. Sci. U.S.A. 85:4715-4719(1988). 

RN [2] 

RP SEQUENCE FROM N . A . 

RC TISSUE=Placenta ; 

RX MEDLINE=89091120; PubMed=3 2 08 74 6 ; 

RA Derynck R., Lindquist P.B., Lee A . , Wen D. , Tamm J., Graycar J.L., 

RA Rhee L. , Mason A.J., Miller D.A. , Coffey R.J., Moses H.L., Chen E.Y.; 

RT "A new type of transforming growth factor-beta, TGF-beta 3."; 

RL EMBO J. 7:3737-3743(1988). 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Madan A. , Rowen L. , Qin S. , Dickhoff R . , Shaffer T. , James R. , 

RA Abbasi N . , Loretz C. , Madan A., Dors M., Dahl T. , Hall J., Lasky S., 

RA Hood L . ; 

RT "Complete genomic sequence of human transforming growth factor-beta 

RT 3 . " ; 

RL Submitted (NOV- 1998 ) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Rieder M.J. , Livingston R.J., Braun A.C., Montoya M.A. , Chung M.-W., 

RA Miyamoto K.E., Nguyen CP., Nguyen D.A. , Poel C.L., Robertson P.D., 

RA Schackwitz W.S., Sherwood J.K., Witrak L.A., Nickerson D.A. ; 

RL Submitted (AUG-2002) to the EMBL/GenBank/DDBJ databases. 

RN [5] 

RP X-RAY CRYSTALLOGRAPHY (2.0 ANGSTROMS) OF 301-412. 

RX MEDLINE=96416253; PubMed=88 19159 ; 

RA Mittl P.R., Priestle J. P., Cox D.A., McMaster G. , Cerletti N., 

RA Grutter M.G. ; 

RT "The crystal structure of TGF-beta 3 and comparison to TGF-beta 2: 

RT implications for receptor binding."; 

RL Protein Sci. 5:1261-1271(1996). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENESI S AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disulfide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformat ics and the EMBL outstation - 

CC the European Bioinformat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

cc 

DR EMBL; J03241; AAA61161.1; -. 

DR EMBL; X14149; CAA32362.1; -. 

DR EMBL; X14885; CAA33024.1; ALT_INIT. 

DR EMBL; X14886; CAA33024.1; JOINED. 

DR EMBL; X14887; CAA33024.1; JOINED. 

DR EMBL; X14888; CAA33024.1; JOINED. 

DR EMBL; X14889; CAA33024.1; JOINED. 

DR EMBL; X14890; CAA33024.1; JOINED. 

DR EMBL; X14891; CAA33 024.1; JOINED. 

DR EMBL; AF107885; AAC79727.1; -. 

DR EMBL; AY140241; AAM96819.1; -. 

DR PIR; A36169; A36169. 

DR PDB; 1TGJ; ll-JAN-97. 



DR 


PDB; 1TGK; 


12-MAR- 


97. 




DR 


PDB; 1KTZ; 


27-FEB- 


02 . 




DR 


Genew; HGNC : 11769 ; 


TGFB3 . 




DR 


MIM; 190230; -. 






DR 


GO; GO:0005160; F: 


transforming growth factor-beta receptor li. 


DR 


GO; GO:0007267; P: 


cell -cell 


signaling; TAS . 


DR 


GO; GO:0007397; P: 


histogenesis and organogenesis; TAS . 


DR 


GO; GO:0007165; P: 


signal transduction; TAS. 


DR 


InterPro; 


IPR0024 00; GF cysknot . 


DR 


InterPro; 


IPR003911; TGF TGFb . 


DR 


InterPro ; 


IPR001839; TGFb . 




DR 


InterPro; 


IPR001111; TGFb N 




DR 


Pfam; PF00019; TGF 


'-beta; 1. 




DR 


Pfam; PF00688; TGFb_propeptide; 1. 


DR 


PRINTS; PR00438; GFCYSKNOT. 




DR 


PRINTS; PRO 142 3; TGFBETA . 




DR 


ProDom; PD0003 57; 


TGFb; 1. 




DR 


SMART ; SM002 04; TGFB; 1. 




DR 


PROSITE; PS00250; 


TGF BETA 


1; 1. 


KW 


Growth factor; Mitogen; Glycoprotein; Signal; 3D-structure . 


FT 


SIGNAL 


1 


20 


POTENTIAL 


FT 


PROPEP 


21 


300 




FT 


CHAIN 


301 


412 


TRANSFORMING GROWTH FACTOR BETA 3. 


FT 


DISULFID 


307 


316 




FT 


DISULFID 


315 


378 




FT 


DISULFID 


344 


409 




FT 


DISULFID 


348 


411 




FT 


DISULFID 


377 


377 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


74 


74 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


135 


135 


N- LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


142 


142 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


261 


263 


CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


STRAND 


303 


303 




FT 


HELIX 


304 


309 




FT 


STRAND 


316 


318 




FT 


STRAND 


321 


323 




FT 


HELIX 


324 


328 




FT 


TURN 


331 


332 




FT 


STRAND 


333 


335 




FT 


STRAND 


338 


340 




FT 


STRAND 


343 


345 




FT 


TURN 


350 


353 




FT 


STRAND 


354 


354 




FT 


HELIX 


357 


368 




FT 


TURN 


370 


371 




FT 


STRAND 


378 


380 




FT 


STRAND 


383 


392 




FT 


TURN 


393 


394 




FT 


STRAND 


395 


406 




FT 


STRAND 


408 


412 




SQ 


SEQUENCE 


412 AA; 


47328 MW; 3CAD3 54 8D3AEA178 CRC64; 


Query Match 




73 .3%; 


Score 502; DB 1; Length 412; 


Best Local Similarity 


76 . 1%; 


Pred. No. 1.2e-46; 



Matches 86; Conservative 12; Mismatches 15; Indels 0; Gaps 0; 



Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 



MINI. II : MIIM 1 1 1 1 1 hi 1 1 1 1! : 1 1 1 1; hi 1 1 , Mill: I II 

Db 3 00 RALDTNYCFRNLEENCCVRPLYIDFRQDL.GWKWVHEPKGYYANFCSGPCPYLRSADTTHS 359 

Qy 68 KVLALYNQHN PGASAAP CCVPQALE PLP I VY YVGR KP KVEQLS NM I VRS CKCS 120 

II III II llhllllll 1 1 1 1 hill llllilllhhIIIM 

Db 360 TVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 412 



RESULT 15 
TGF3_RAT 

ID TGF3_RAT STANDARD; PRT; 412 AA. 

AC Q07258; 

DT 01-OCT-1994 (Rel . 30, Created) 

DT 01-FEB-1996 (Rel. 33, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 OR TGF-B3 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=l 0116; 

RN [ 1 ] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Wistar ; TISSUE=Lung ; 

RX MEDLINE=95155340; PubMed=7852342 ; 

RA Wang J . , Kuliszewski M . , Yee W. , Sedlackova L. # Xu J . , Tseu I., 

RA Post M. ; 

RT "Cloning and expression of glucocorticoid- induced genes in fetal rat 

RT lung fibroblasts. Transforming growth factor-beta 3."; 

RL J. Biol. Chem. 270:2722-2728(1995). 

RN [2] 

RP SEQUENCE OF 159-213 FROM N.A. 

RX MEDLINE=93286190; PubMed=8509457 ; 

RA McKinnon R.D., Piras G . , Ida J., Dubois-Dalq M. ; 

RT "A role for TGF-beta in oligodendrocyte differentiation."; 

RL J. Cell Biol. 121:13 97-1407(1993). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENES I S AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disul fide -linked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWTSS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib. ch) . 

CC 

DR EMBL; U03491; AAA67915.1; -. 

DR EMBL; X71903; CAA50722.1; -. 

DR PIR; A55706; A55706. 

DR HSSP; P10600; 1TGJ. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF__TGFb . 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro; IPR001111; TGFb N. 



DR 


Pfam; PF00019; TGF-beta; 1. 




nR 


Pfam; PF00688; TGFb_propeptide; 1. 


DR 


PRINTS; 


PR00438; 


GFCYSKNOT. 




UK 


PRINTS; 


PR01423; 


TGFB ETA . 




LJt\ 


ProDom; 


PD000357; 


TGFb; 1. 




OR 


SMART; SM00204; TGFB; 1. 




nR 


PROSITE; 


PS00250; 


TGF_BETA 1; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


300 


POTENTIAL. 


FT 


CHAIN 


301 


412 


TRANSFORMING GROWTH FACTOR BETA 3. 


FT 


DISULFID 


307 


316 


BY SIMILARITY. 


FT 


DISULFID 


315 


378 


BY SIMILARITY. 


FT 


DISULFID 


344 


409 


BY SIMILARITY. 


FT 


DISULFID 


348 


411 


BY SIMILARITY. 


FT 


DISULFID 


377 


377 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


74 


74 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


135 


135 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


142 


142 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


261 


263 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


412 AA 


,; 47116 MW; 


24FD7D899090AA9D CRC64 ; 


Query Match 




73 . 1%; 


Score 501; DB 1; Length 412; 



Best Local Similarity 76.1%; Pred . No. 1.5e-46; 

Matches 86; Conservative 12; Mismatches 15; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

: hlllll lllllhlllllhlllllhllll MM | || 
Db 300 RALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSSDTTHS 35 9 

Qy 68 KVTLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

II III II llhllllll I I I I hlllll llllllllhhIMII 
Db 360 TVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 412 



Search completed: October 28, 2003, 09:08:38 
Job time : 2.4052 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



October 28, 2003, 07:50:55 ; Search time 9.6208 Seconds 

(without alignments) 
3218.683 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-017-372E-11 
685 

1 DYKDDDDKALDTNYCFSSTE GRKPKVEQLSNM I VRSCKCS 120 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



830525 seqs, 258052604 residues 



Total number of hits satisfying chosen parameters: 830525 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : SPTREMBL_23 : * 

1 : sp_archea : * 

2: sp_bacteria : * 

3 : sp_f ungi : * 

4 : sp_human : * 

5 : sp_invertebrate : * 

6: sp_mammal:* 

7 : sp_mhc : * 

8: sp_organelle : * 

9 : sp_phage : * 

10: sp_plant:* 

11: sp_rodent:* 

12: sp_ViruS:* 

13: sp_vertebrate : * 

14 : sp__unclassi f ied: * 

15: sp_rvirus:* 

16 : sp_bacteriap : * 

17: sp_archeap:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


638 


93 . 1 


112 


6 


002730 


002730 oryctolagus 


2 


637 


93 . 0 


368 


11 


Q8R4D9 


Q8r4d9 sigmodon hi 


3 


628 


91.7 


390 


6 


Q9TUM8 


Q9tum8 equus cabal 


4 


625 


91 .2 


130 


11 


Q08714 


Q08714 mesocricetu 


5 


622 


90.8 


124 


6 


Q95N80 


Q95n80 canis famil 


6 


554 


80 . 9 


101 


11 


Q9R184 


Q9rl84 meriones un 


7 


502 


73 .3 


362 


11 


Q99K17 


Q99kl7 mus musculu 


8 


502 


73 .3 


412 


11 


Q91YU7 


Q91yu7 mus musculu 


9 


482 


70.4 


255 


11 


Q921T1 


Q921tl mus musculu 


10 


482 


70 .4 


414 


11 


Q91VP5 


Q91vp5 mus musculu 


11 


471.5 


68 .8 


200 


13 


Q90YF1 


Q90yfl pleuronecte 


12 


470 


68.6 


224 


11 


Q8CDZ9 


Q8cdz9 mus musculu 


13 


470 


68 . 6 


382 


13 


093449 


09344 9 oncorhynchu 


14 


465. 5 


68 . 0 


382 


13 


Q9PWA9 


Q9pwa9 morone chry 


15 


464 . 5 


67.8 


379 


13 


Q8JHF5 


Q8jhf5 sparus aura 


16 


460.5 


67.2 


379 


13 


Q8AXK8 


Q8axk8 sparus aura 


17 


440 


64 .2 


361 


13 


Q98854 


Q98854 cyprinus ca 


18 


436 


63 . 6 


399 


11 


Q9ERB7 


Q9erb7 mesocricetu 


19 


429 


62 .6 


376 


13 


Q9PTQ2 


Q9ptq2 cyprinus ca 


20 


413 


60.3 


88 


13 


Q90YF5 


Q90yf5 pleuronecte 
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202.5 


29. 


.6 


389 


13 


Q90YY0 


Q90yy0 ictalurus p 


39 


200 


29. 


.2 


426 


4 


Q9HBP0 


Q9hbp0 homo sapien 


40 


199.5 


29 


. 1 


373 


13 


Q90ZD2 


Q90zd2 oncorhynchu 


41 


199.5 


29. 


. 1 


373 


13 


Q90ZD1 


Q90zdl oncorhynchu 


42 


199.5 


29. 


. 1 


373 


13 


Q9DDI8 


Q9ddi8 salmo salar 


43 


199.5 


29. 


. 1 


376 


13 


Q98TB4 


Q98tb4 oreochromis 


44 


199 . 5 


29. 


. 1 


376 


13 


Q9 0WC9 


Q90wc9 morone saxa 


45 


199.5 


29. 


. 1 


376 


13 


Q90WC8 


Q90wc8 morone amer 



ALIGNMENTS 

RESULT 1 
002730 



ID 002730 PRELIMINARY; PRT; 112 AA. 

AC 002730; 097501; 

DT 01-JUL-1997 (TrEMBLrel . 04, Created) 

DT 01-JUL-1997 (TrEMBLrel. 04, Last sequence update) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFB1 OR TGF-BETA- 1. 

OS Oryctolagus cuniculus (Rabbit) . 

0C Eukaryota; Metazoa; Chordata; Craniata; Vertebra ta; Euteleostomi ; 

0C Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Taylor T.K., James E.R., McGomgle S., Yoho E.R.; 

RL Submitted (APR-1997) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 2-99 FROM N.A. 

RA Inoue K. , Kawabe Y., Kodama T. ; 

RL Submitted (NOV-1998) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: TGF-BETA 1 IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 



CC -1- SUBUNIT: HOMODIMER, DI SULFIDE -LINKED (BY SIMILARITY) . 

CC SIMILARITY: BELONGS TO THE TGF-BETA FAMILY . 

DR EMBL; AF000133 ; AAB53806.1; 

DR EMBL; AB020217; BAA36950 . 1 ; -. 

DR HSSP; P01137; 1KLA . 

DR InterPro; IPR002400; GF__cysknot . 

DR InterPro; IPR001839; TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1; 1. 

KW Growth factor; Mitogen; Glycoprotein. 



FT NON_TER 


1 


1 




FT CHAIN 


1 


112 


TRANSFORMING GROWTH FACTOR BETA 1 


FT DISULFID 


7 


16 


BY SIMILARITY. 


FT DISULFID 


15 


78 


BY SIMILARITY. 


FT DISULFID 


44 


109 


BY SIMILARITY. 


FT DISULFID 


48 


111 


BY SIMILARITY. 


FT DISULFID 


77 


77 


INTERCHAIN (BY SIMILARITY) . 


FT CONFLICT 


2 


3 


LD -> FS (IN REF. 2) . 


FT CONFLICT 


85 


92 


PLPIVYYV -> ATAHRVTTL (IN REF. 2) 


SQ SEQUENCE 


112 AA; 


12795 MW; 


53C5B7D46355A6F3 CRC64; 


Query Match 




93.1%; 


Score 638; DB 6; Length 112; 


Best Local Similarity 


100 . 0%; 


Pred. No. 2.8e-66; 



Matches 112; Conservative 



0; Mismatches 



0; Indels 



0 ; Gaps 



Qy 

Db 



9 ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSK 68 

iii i ii 1 1 1 1 1 Mi ii 1 1 1 1 1 ii 1 1 !i ih i ii 1 1 1 1 1 ;i 1 1 1 m 1 1 1 1 1 ii i ii 1 1 1 

1 ALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSK 60 



Qy 
Db 



69 VLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

I iiiii 1 1 1 1 1 ii i ii 'i 1 1 1 ;■ ii 1 1 ii i i;i ii m i ii :i 1 1 1 u 1 1 1 m 

61 VLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 112 



RESULT 2 
Q8R4D9 

ID Q8R4D9 PRELIMINARY; PRT; 368 AA. 

AC Q8R4D9; 

DT 01-JUN-2002 (TrEMBLrel . 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta-1 protein (Fragment) . 

GN TGFB1 . 

OS Sigmodon hispidus (Hispid cotton rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Sigmodont inae; 

OC Sigmodon. 

OX NCBI_TaxID=42415; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Blanco J.C., Pletneva L.M., Prince G . A . ; 

RT "Cotton rat cytokines, chemokines, and interferons."; 

RL Submitted (FEB-2002) to the EMBL / GenBa nk / DDB J databases. 
CC SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 



DR EMBL; AF480858; AAL87199.1; 

DR InterPrO; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb . 

DR InterPro ; IPR001111; TGFbJJ. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM002 04; TGFB ; 1. 

DR PROSITE; PS00250; TGF__BETA_1 ; 1. 

FT NONJTER 1 1 

SQ SEQUENCE 368 AA; 41905 MW; A5C9 12 0 7B04 68B4A CRC64 ; 

Query Match 93.0%; Score 637; DB 11; Length 368 ; 

Best Local Similarity 98.2%; Pred. No. 1.4e-65; 

Matches 111; Conservative 2 ; Mismatches 0; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

UIIMIIIIIIIIIIIMIIIIIIIIIIMIIIIMMIIIIIIIIIIIIIIIIIIIII 

Db 256 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYS 315 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

MMIIIIIIIIIIhlMIMMIMIMIIIIIIIMMIIMIIMMM 

Db 316 KVLALYNQHNPGASAS PCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 368 



RESULT 3 
Q9TUM8 

ID Q9TUM8 PRELIMINARY; PRT; 390 AA. 

AC Q9TUM8 ; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

GN TGFB1. 

OS Equus caballus (Horse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Perissodactyla ; Equidae; Equus. 

OX NCBI_TaxID=9796; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Nixon A. J., Brower-Toland B.T., Sandell L.J.; 

RT "Molecular cloning of equine transforming growth factor beta 1 reveals 

RT a divergent nucleotide structure that encodes a novel bioactive 

RT peptide among mammalian species."; 

RL Submitted (AUG-1999) to the EMBL/ GenBank/DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF175709; AAD49431.1; -. 

DR HSSP; P01137; UCLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; I PRO 03 911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PRO 043 8; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 



DR ProDom; PD000357; TGFb; 1. 

DR SMART ; SM00204; TGFB; 1. 

DR PROSITE; PS 002 50; TGF_BETA_1 ; 1. 

SQ SEQUENCE 390 AA; 438G0 MW; 220FE40DFCCA6016 CRC64 ; 

Query Match 91.7%; Score 628; DB 6; Length 390; 

Best Local Similarity 97.3%; Pred. No. 1.7e-64; 

Matches 110; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

:|IMIII II 1 1 II I II 1 1 1 1 1 1 1 II I I IN I I I Mllll 1 1 1 1 1 1 II 1 1 1 II 1 1 1 Ml 

Db 278 RALDTNYCSSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 337 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 

IIIIIIIIIIIIIIIIIMIII I II' II II, 1 1 1 !i I IMI 1 1 1 1 1 II i II I 

Db 33 8 KVLALYNQHNPGASAAPCCVPQVLEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 4 
Q08714 

ID Q08714 PRELIMINARY; PRT; 130 AA . 

AC Q08714; 070331; 

DT 01-NOV-1996 (TrEMBLrel . 01, Created) 

DT 01-NOV-1996 (TrEMBLrel. 01, Last sequence update) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFB1. 

OS Mesocricetus auratus (Golden hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Cricetinae; 

OC Mesocricetus. 

OX NCB I _Ta x I D= 1 0 0 3 6 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=LVG (SYR) ; 

RX MEDLINE=93304479; PubMed=83 17544 ; 

RA Wong D.T., Donoff R.B., Yang J., Song B.Z., Matossian K. , Nagura N . , 

RA Elovic A., McBride J., Gallagher G. , Todd R. ; 

RT "Sequential expression of transforming growth factors alpha and beta 1 

RT by eosinophils during cutaneous wound healing in the hamster."; 

RL Am. J. Pathol. 143:130-142(1993). 

RN [2] 

RP SEQUENCE OF 26-115 FROM N.A. 

RC STRAIN=SYRIAN; TISSUE=SPLEEN; 

RX MEDLINE=9823 4 044 ; PubMed=9573 100 ; 

RA Melby P.C., Tryon V.V., Chandrasekar B. , Freeman G.L.; 

RT "Cloning of Syrian hamster (Mesocricetus auratus) cytokine cDNAs and 

RT analysis of cytokine mRNA expression in experimental visceral 

RT leishmaniasis . " ; 

RL Infect. Immun. 66:2135-2142(1998). 

CC -!- FUNCTION: TGF-BETA 1 IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 
CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: HOMODIMER, D I SULFIDE -LINKED . 



CC -!- SIMILARITY: TO OTHER GROWTH FACTORS OF THE TGF-BETA FAMILY. 

DR EMBL; X60296; CAA42838.1; 

DR EMBL; AF046214; AAC40099.1; 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR001839; TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR ProDom; PDO 0 0357; TGFb; 1. 

DR SMART ; SM00204; TGFB ; 1. 

DR PROSITE; PS0 025 0; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 

FT NON_TER 1 1 

FT PROPEP <1 18 

FT CHAIN 19 130 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 25 34 BY SIMILARITY. 

FT DISULFID 33 96 BY SIMILARITY. 

FT DISULFID 66 129 BY SIMILARITY. 

FT DISULFID 95 95 INTERCHAIN (BY SIMILARITY) . 

FT CONFLICT 9 3 93 G -> S (IN REF . 2) . 

SQ SEQUENCE 13 0 AA; 14 997 MW; 8B4 1DD6CF3 9CCA77 CRC64 ; 

Query Match 91.2%; Score 625; DB 11; Length 130; 

Best Local Similarity 97.3%; Pred. No. l.le-64; 

Matches 110; Conservative 1; Mismatches 2; Indels 0; Gaps 0; 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

:|MMIMIIIMIIIIMIMMIIIIMIIIIMIIIIIIIMIIIIIIIIIIIIII 

Db 18 RALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYS 77 

Qy 68 KVLALYNQHN PGASAAPCCVPQALE PLP I VY Y VGRKPK VEQLSNMI VRS CKCS 120 

MIIMIMMIMI IMIIIMMMIIIIMIMIIIMIIMMI III 

Db 78 KVLALYNQHNPGASAGPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSYKCS 130 

RESULT 5 
Q95N80 

ID Q95N80 PRELIMINARY; PRT; 124 AA . 

AC Q95N8 0; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1 (Fragment) . 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Fonfara S., Groene A., Baumgaertner W. ; 

RT "Sequence of canine transforming growth factor beta 1 mRNA in DH82- 

RT cells."; 

RL Submitted (FEB-2 001) to the EMBL/ GenBank / DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF349538; AAK54072.1; -. 

DR InterPro; IPR001839; TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR ProDom; PDO 003 57; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 



DR PROSITE; PS 002 50; TGF_BETA__1 ; 1. 

FT NON_TER 1 1 

FT NONJTER 124 124 

SQ SEQUENCE 124 AA; 14329 MW ; 21D1852 18E5556DB CRC64 ; 



Query Match 90.8%; 
Best Local Similarity 99.1%; 
Matches 109; Conservative 



Score 622; DB 6; Length 124; 
Pred. No. 2.3e-64; 
1; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 
Db 



8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKVIIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

UMMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 

15 RALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 74 



Qy 
Db 



68 KVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSC 117 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
7 5 KVLALYNQHN PGASAAPCCVPQALEPLP I VY YVGRKPKVEQLSNMI VRS C 124 



RESULT 6 
Q9R184 



ID Q9R184 PRELIMINARY; PRT; 101 AA . 

AC Q9R184; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFB1 OR TGF-BETA. 

OS Meriones unguiculatus (Mongolian jird). 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Gerbillinae; 

OC Meriones . 

OX NCBI_TaxID=10047; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=LYMPH NODE; 

RA Rao U.R. , Klei T.R. ; 

RT "cDNA cloning of gerbil transforming growth factor-beta by PCR.",- 

RL Submitted (JUN-1999) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: TGF-BETA 1 IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: HOMODIMER, D I SULF I DE - LI NKED (BY SIMILARITY) . 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF161218; AAD45726.1; -. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR001839; TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR ProDom; PD0 00357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 

FT NON_TER 1 1 

FT CHAIN <1 >101 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 1 10 BY SIMILARITY. 



FT DISULFID 9 72 BY SIMILARITY . 

FT DISULFID 71 71 INTERCHAIN (BY SIMILARITY) . 

FT NONJTER 101 101 

SQ SEQUENCE 101 AA; 11724 MW; ABF1CFDA2 64AEFED CRC64; 

Query Match 80.9%; Score 554; DB 11; Length 101; 

Best Local Similarity 96.0%; Pred. No. 1.4e-56; 

Matches 97; Conservative 1; Mismatches 3; Indels 0; Gaps 0; 

Qy 15 CFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYN 74 

lllllllllllllll llllllllllllllllllllllllllllllll IIIMIIIIII 
Db 1 CFSSTEKNCCVRQLYRDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLHTQYSKVLALYN 60 

Qy 7 5 QHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVR 115 

lllllllhlllllllllllllllllllllllllllll II 
Db 61 QHNPGASASPCCVPQALEPLP I VYYVGRKPKVEQLSNMFVR 101 



RESULT 7 
Q99K17 

ID Q99K17 PRELIMINARY; PRT; 362 AA . 

AC Q99K17; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to transforming growth factor, beta 3 (Fragment) . 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI JTaxI D= 100 9 0; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Strausberg R.; 

RL Submitted (MAR-2001) to the EMBL/ GenBank/DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC005513; AAH05513.1; -. 

DR HSSP; P10600; 1TGJ. 

DR MGD; MGI: 98 727; Tgf b3 . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; I PRO 03911; TGFJTGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA. 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART ; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

FT NONJTER 1 1 

SQ SEQUENCE 362 AA; 41486 MW; 0808E46180FDAE70 CRC64 ; 



Query Match 73.3%; Score 502; DB 11; Length 362; 

Best Local Similarity 76.1%; Pred. No. 6.6e-50; 

Matches 86; Conservative 12; Mismatches 15; Indels 0; Gaps 0; 



Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

III II! : hlllll IIMIhlllllhlMllhlMI MMh I II 
Db 2 50 RALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSADTTHS 3 09 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMIVRSCKCS 120 

(I Ml 1 1 MhMIMI MM hlllll llllllllhhillll 

Db 310 TVLGLYNTLNPEASASPCCVPQDLEPLTI LYYVGRTPKVEQLSNMWKSCKCS 362 

RESULT 8 
Q91YU7 

ID Q91YU7 PRELIMINARY; PRT; 412 AA . 

AC Q91YU7; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor, beta 3. 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Strausberg R. ; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY . 

DR EMBL; BC014690; AAH14690.1; 

DR MGD; MGI : 98727; Tgf b3 . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N. 

DR InterPro; IPR003911; TGFJTGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 412 AA; 47144 MW; F3EB65D046DF32AD CRC64 ; 

Query Match 73.3%; Score 502; DB 11; Length 412; 

Best Local Similarity 76.1%; Pred. No. 7.7e-50; 

Matches 86; Conservative 12; Mismatches 15; Indels 0; Gaps 0 
Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYPiANFCLGPCPYIWSLDTQYS 67 

MIMIIM : hlllll 1 1 1 II h I M I ! h M ! M M ! M ! MMh ! !! • I 

Db 3 00 RALDTNYCFRNLEENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSADTTHS 359 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMIVRSCKCS 120 

II III II II hi Mill MM hill M III Mill hhl III I 

Db 36 0 TVLGLYNTLNPEASASPCCVPQDLEPLTI LYYVGRTPKVEQLSNMWKSCKCS 412 

RESULT 9 
Q921T1 



ID Q921T1 PRELIMINARY; PRT; 255 AA. 

AC Q921T1; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to transforming growth factor, beta 2. 

GN TGFB2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCB I _Tax ID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Strausberg R. ; 

RL Submitted (JUL-2001) to the EMBL/GenBank/DDBJ databases. 

CC -1- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY . 

DR EMBL; BC011055; AAH11055.1; -. 

DR MGD; MGI : 98726 ; Tgf b2 . 

DR InterPro; IPR0024 00; GFjzysknot . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; I PRO 03 911; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb^propept ide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423 ; TGFBETA. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS 00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 255 AA; 29087 MW ; 9454 0017F3C5C2 1 9 CRC64 ; 

Query Match 70.4%; Score 482; DB 11; Length 255; 

Best Local Similarity 70.8%; Pred. No. 9.3e-48; 

Matches 80; Conservative 16; Mismatches 17; Indels 0; Gaps 0; 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

III : : MM I I I I h : I I I I I I I I I I I I I : I I I I I llhll Ml- 
Db 143 RALDAAYCFRNVQDNCCLRPLYIDFKRDLGWKWIHEPKGYNANFCAGACPYLWSSDTQHT 2 02 

Qy 68 KVLALYNQHNPGASAAPCCVPQALE PLP I VY Y VGRKP KVEQLSNM I VRS CKCS 120 

II hill II 1 1 1 : 1 1 1 1 I 1 1 1 1 h 1 1 • I Ihllll MINIMI 

Db 2 03 KVLSLYNT I N PEASAS PCCVSQDLE PLT I LY Y I GNTPKI EQLSNMI VKS CKCS 2 55 



RESULT 10 
Q91VP5 

ID Q91VP5 PRELIMINARY; PRT; 414 AA. 

AC Q91VP5; 

DT Ol-DEC-2001 (TrEMBLrel. 19, Created) 

DT Ol-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to transforming growth factor, beta 2. 

GN TGFB2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 



RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Breast tumor; 

RA Strausberg R . ; 

RL Submitted (JUL-2001) to the EMBL/ GenBank/DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC011170; AAH11170.1; -. 

DR MGD; MGI: 98726; Tgf b2 . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb__N . 

DR InterPro; I PRO 03 911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PRO 043 8; GFCYSKNOT. 

DR PRINTS; PRO 1423; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 414 AA; 47588 MW; DB37A7C38881F286 CRC64 ; 



Query Match 70.4%; Score 482; DB 11; Length 414; 

Best Local Similarity 70.8%; Pred. No. 1.6e-47; 

Matches 80; Conservative 16; Mismatches 17; Indels 0; Gaps 0, 

Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYS 67 

■II Ml : : UN MM::, I II II I 'llllhlli I | Ihl Ml- 
Db 302 RALDAAYCFRNVQDNCCLRPLYIDFKRDLGWKWIHEPKGYNANFCAGACPYLWSSDTQHT 361 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

Ilhlll II llhllll I I I I I hlhl Ihlllllllhlllll 
Db 362 KVLSLYNTINPEASASPCCVSQDLEPLTILYYIGNTPKIEQLSNMIVKSCKCS 414 



RESULT 11 
Q90YF1 

ID Q90YF1 PRELIMINARY; PRT; 200 AA . 

AC Q90YF1; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1 (Fragment) . 

GN TGF - BETA 1. 

OS Pleuronectes platessa (Plaice) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha ; Acanthopterygii ; Percomorpha; Pi euronecti formes ; 

OC Pleuronectoidei ; Pleuronectidae ; Pleuronectes. 

OX NCBI_TaxID=8262; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Laing K.J. ; 

RL Submitted (AUG-2001) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20394651; PubMed=10938738 ; 

RA Laing K.J., Cunningham C. , Secombes C.J.; 



RT "Genes for three different isoforms of transforming growth factor-beta 

RT are present in plaice ( Pleuronectes platessa) DNA . " ; 

RL Fish and Shellfish Immunol. 10:261-271(2000). 

CC -!- SIMILARITY : BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; A J3 18 933 ; CAC60268.1; -. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR PRINTS; PRO 043 8; GFCYSKNOT . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA__1 ; 1. 

FT N0N_TER 1 1 

FT NON_TER 200 200 

SQ SEQUENCE 200 AA; 22851 MW; 4 876FEB6A263B4CC CRC64 ; 

Query Match 68.8%; Score 471.5; DB 13; Length 200; 

Best Local Similarity 73.4%; Pred . No. 1.2e-46; 

Matches 80; Conservative 14; Mismatches 14; Indels 1; Gaps 1; 
Qy 12 TNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYIiANFCLGPCPYIWSLDTQYSKVLA 71 

h I = II IMhIIIIMIIIIIIIIhl llllhhl I llh : Hl-ll 

Db 93 TDTCTAQTE-TCCVRKLYIDFRKDLGWKWIHKPTGYHANYCMGSCTYIWNAENKYSQILA 151 

Qy 72 LYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 12 0 

II lllllll IMIIIIIIIIIhllllh II I I I I II MINI 
Db 152 LYKHHNPGASAQPCCVPQALEPLPILYYVGRQHKVEQLSNMSVKSCKCS 200 



RESULT 12 
Q8CDZ9 

ID Q8CDZ9 PRELIMINARY; PRT; 224 AA . 

AC Q8CDZ9; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor (Fragment) . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murmae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC STRAIN=C57BL/6J; TISSUE=Head; 

RX MEDLINE=22354683; PubMed=124 66851 ; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK029306; BAC26384.1; -. 

FT NON_TER 1 1 

SQ SEQUENCE 224 AA; 25538 MW ; 9264D6C878CBD2BC CRC64 ; 



Query Match 68.6%; Score 470; DB 11; Length 224; 

Best Local Similarity 69.9%; Pred. No. 2e-46; 

Matches 79; Conservative 16; Mismatches 18; Indels 0; Gaps 0; 



Qy 8 KALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYS 67 

III Ml : : Ml I II I h ^ II II M I I I I II hl'h | |||:|| Ml- 
Db 112 RALDAAYCFRNVQDNCCLRPLYIDFKRDLGWKWIHEPKGYNANFCAGACPYLWSSDTQHT 171 

Qy 68 KVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

Ml: IN M I II: III I I MM hi hi I hill I II MM III 

Db 172 KVLSLYNTINPEASASPCCVSQDLEPLTILYYIGNTPKIEQLSNMI VKSRKCS 224 

RESULT 13 
093449 

ID 093449 PRELIMINARY; PRT; 382 AA. 

AC 093449; Q91217; 

DT 01-NOV-1998 (TrEMBLrel . 08, Created) 

DT 01-NOV-1998 (TrEMBLrel. 08, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

GN TGF-BETA OR TGF . 

OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; 

OC Protacanthopterygii ; Salmoni formes ; Salmonidae; Oncorhynchus. 

OX NCBI_TaxID=8 022; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TI SSUE=LEUKOCYTE ; 

RX MEDLINE=99242020; PubMed= 1022 74 8 1 ; 

RA Daniels G.D., Secombes C. J. ; 

RT "Genomic organisation of rainbow trout, Oncorhynchus mykiss TGF- 

RT BETA."; 

RL Dev. Comp. Immunol. 23:139-147(1999). 

RN [2] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TI SSUE=LEUKOCYTE ; 

RX MEDLINE=98390168; PubMed=9722 928 ; 

RA Hardie L.J., Laing K. J. , Daniels G.D., Grabowski P.S., Cunningham C. , 

RA Secombes C.J. ; 

RT "Isolation of the first piscine transforming growth factor beta gene: 

RT analysis reveals tissue specific expression and a potential regulatory 

RT sequence in rainbow trout (Oncorhynchus mykiss)."; 

RL Cytokine 10:555-563(1998). 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 

CC RESPONSE. MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS. 

CC -!- SUBUNIT: HOM0DIMER, DISULFIDE -LINKED (BY SIMILARITY). 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN BLOOD LEUKOCYTES, KIDNEY 

CC MACROPHAGES, BRAIN, GILL AND SPLEEN BUT NOT IN LIVER. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AJ007836; CAA07707.1; 

DR EMBL; X99303; CAA67685.1; -. 

DR HSSP; P01137; 1KLA . 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro,- IPR001111; TGFb_N. 

DR InterPro; IPR003911; TGFJTGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 



DR 


PRINTS; 


PR00438; 


GFCYSKNOT . 




DR 


PRINTS ; 


PR01423; 


TGFBETA . 




DR 


ProDom; 


PD000357; 


TGFb; 1. 




DR 


SMART; SM00204; TGFB ; 1. 




DR 


PROSITE; 


PS00250; 


TGF_BETA_1; 


1. 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


20 


POTENTIAL. 


FT 


PROPEP 


21 


270 




FT 


CHAIN 


271 


382 


TRANSFORMING GROWTH FACTOR BETA. 


FT 


DISULFID 


278 


286 


BY SIMILARITY. 


FT 


DISULFID 


285 


348 


BY SIMILARITY. 


FT 


DISULFID 


314 


379 


BY SIMILARITY. 


FT 


DISULFID 


318 


381 


BY SIMILARITY. 


FT 


DISULFID 


347 


347 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


76 


76 


N-LINKED ( GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


116 


116 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


125 


125 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CONFLICT 


237 


237 


N - > D (IN REF. 2) . 


FT 


CONFLICT 


345 


345 


Q -> H (IN REF. 2) . 


FT 


CONFLICT 


371 


372 


LS -> VP (IN REF . 2) . 


FT 


CONFLICT 


377 


377 


K - > M (IN REF. 2) . 


SQ 


SEQUENCE 


382 AA; 44136 MW; 


93BD4D3540084B92 CRC64 ; 



Query Match 68.6%; Score 470; DB 13; 

Best Local Similarity 75.0%; Pred. No. 3.6e-46; 
Matches 78; Conservative 13; Mismatches 13; 



Length 3 82; 
Indels 0; 



Gaps 



0; 



QY 
Db 

QY 
Db 



17 SSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQH 76 

I ::||lhllllMIIIIIIIIIII II Ihhlll llh : Uhlllll I 
279 SDKSESCCVRKLYIDFRKDLGWKWIHEPTGYFANYCIGPCTYIWNTENKYSQVLALYKHH 338 

77 NPGASAAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 12 0 

IIIMI Ml' lllllhlillh IIIIIMIIhlhll 
33 9 NPGASAQPCCVPQVLEPLPI I YYVGRQHKVEQLSNMI VKSCRCS 382 



RESULT 14 
Q9PWA9 

ID Q9PWA9 PRELIMINARY; PRT; 382 AA . 

AC Q9PWA9; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

GN TGF-BETA . 

OS Morone chrysops x Morone saxatilis (white bass x striped bass) . 

OC EuXaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; Perciformes; Percoidei; 

OC Moronidae; Morone. 

OX NCBI_TaxID=45352; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TI SSUE=KI DNEY ; 

RX MEDLINE=2 03 94 636; PubMed=l 0 93 8 723 ; 

RA Harms C.A. , Kennedy-Stoskopf S., Horne W.A. , Fuller F.J., 

RA Tompkins W.A.F.; 



RT "Cloning and sequencing hybrid striped bass (Morone saxatilis x M . 

RT chrysops) transforming growth factor-beta (TGF-beta) , and development 

RT of a reverse transcription quantitative competitive polymerase chain 

RT reaction (RT-qcPCR) assay to measure TGF-beta mRNA of teleost fish."; 

RL Fish Shellfish Immunol. 10:61-85(2000). 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 
CC RESPONSE. MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS . 

CC -!- SUBUNIT: HOMODIMER, D I SULFI DE -LI NKED (BY SIMILARITY). 

CC -!- TISSUE SPECIFICITY: HIGHER LEVELS FOUND IN MONONUCLEAR CELLS FROM 
CC PERIPHERAL BLOOD THAN IN SPLEEN OR ANTERIOR KIDNEY. 



CC 


-!- SIMILARITY: 


BELONGS TO THE TGF-BETA FAMILY. 


DR 


EMBL; AF14 0363; 


AAD46997.1; - 




DR 


HSSP; P01137; 1KLA. 




DR 


InterPro; 


IPR0024 00; GF_cysknot. 


DR 


InterPro; 


IPR001839; TGFb. 




DR 


InterPro ; 


IPR001111; TGFb N, 




DR 


InterPro; 


IPR003911; TGF TGFb. 


DR 


Pfam; PF00019; TGF-beta; 1. 




DR 


Pfarn; PF00688; TGFb_propept ide ; 1. 


DR 


PRINTS; PR00438; 


GFCYSKNOT. 




DR 


PRINTS; PR01423; 


TGFB ETA . 




UK 


ProDom; PD000357; TGFb; 1. 




DR 


SMART; SM0 0204; 


TGFB; 1. 




DR 


PROSITE; 


PS00250; TGF BETA 1; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 




POTENTIAL . 


FT 


PROPEP 


? 


270 




FT 


CHAIN 


271 


382 


TRANSFORMING GROWTH FACTOR BETA. 


FT 


DISULFID 


278 


286 


BY SIMILARITY. 


FT 


DISULFID 


285 


348 


BY SIMILARITY. 


FT 


DISULFID 


314 


379 


BY SIMILARITY. 


FT 


DISULFID 


318 


381 


BY SIMILARITY. 


FT 


DISULFID 


347 


347 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


73 


73 


N-LINKED ( GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


108 


108 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


113 


113 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


124 


124 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


259 


259 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


235 


237 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


382 AA; 43846 MW; 


3124D8C34EA74D72 CRC64 ; 



Query Match 68.0%; Score 465.5; DB 13; Length 382; 

Best Local Similarity 74.5%; Pred. No. 1.2e-45; 

Matches 79; Conservative 13; Mismatches 13; Indels 1; Gaps 1; 

Qy 15 CFSSTEJCNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYN 74 

I : II I I I I I !i I I i I I I i I I I I ; : I I j : I I : I : I I llh : Ul-llll 
Db 278 CTAQTE -TCCVRSLYI DFRKDLGWKWI HKPTGYNANYCMGSCTYIWNAENKYSQI LALYK 336 

Qy 75 QHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 120 

Illllll llllllllllllhllllh lllllllllhlllll 
Db 337 HHNPGASAQPCCVPQALEPLPILYYVGRQHKVEQLSNMI VKSCKCS 382 



RESULT 15 

Q8JHF5 

ID Q8JHF5 



PRELIMINARY; PRT; 379 AA . 



AC Q8JHF5; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT Ol-OCT-2002 (TrEMBLrel . 22, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE Transforming growth factor beta 1. 

OS Sparus aurata (Gilthead sea bream) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygi i ; Teleostei; Euteleostei ; Neoteleostei ; 

OC Acanthomorpha ; Acanthopterygi i ; Percomorpha ,- Perci formes; Percoidei; 

OC Sparidae; Sparus. 

OX NCBI_TaxID=8175; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Tafalla C. , Aranguren R., Secombes C.J., Castrillo J.L., Novoa B . , 

RA Figueras A. ; 

RT "Molecular characterization of sea bream (Sparus aurata) transforming 

RT growth factor betal."; 

RL Fish and Shellfish Immunol. 0:0-0(2002). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF510084; AAN03842.1; -. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR InterPro; IPR003911; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 2. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA. 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS 00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 379 AA; 43506 MW; C0C9D3D2FCA2 9C0E CRC64 ; 

Query Match 67.8%; Score 464.5; DB 13; Length 379; 

Best Local Similarity 74.5%; Pred. No. 1.6e-45; 

Matches 79; Conservative 12; Mismatches 14; Indels 1; Gaps 1; 
Qy 15 CFSSTEKNCCVRQLY I DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYN 74 



Db 




Qy 



75 QHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 120 



Db 



334 HHNPGASAQPCCVPQALEPLP I LYYVGRQHKVEQLSNMI VKSCKCS 379 



Search completed: October 28, 2003, 09:12:24 
Job time : 10.6208 sees 



GenCore version 5.1.6 
Copyright (c) 19 93 - 2 0 03 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 27, 2003, 10:47:27 ; Search time 4668.52 Seconds 

(without alignments) 
10489.161 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



US-10-017-372E-12 
1197 

1 atggcgccttcggggctgcg gttcctgcaagtgcagctga 1197 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



2888711 seqs, 20454813386 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



5777422 



Database 



GenEmbl : * 

1 : gb _ba : * 

2 : gbjitg : * 

3 : gb_m : * 
4 : gb_om : * 
5 : gb_ov : * 
6: gb_pat:* 
7 : gb_ph : * 
8: gb_pl:* 

9 : gb_pr : * 
1 0 : gb_ro : * 
11: gb_StS:* 

12: gb_sy:* 
13: gb_un:* 
14: gb_yi:* 
1 5 : em_ba : * 
16: em_fun:* 
17: em_hum:* 
18: em_m:* 
1 9 : em_mu : * 
2 0 : em_om : * 
21: em__or:* 
22: em_ov:* 
23: em_pat:* 
24: em_ph:* 

25: em_pl:* 
26: em_ro:* 
27 : em StS : * 



28 


em un : * 


29 


em_yi : * 


30 


em_htg_hum : * 


31 


em_htg_inv: * 


32 


em_htg_other : * 


33 


em htg mus : * 


34 


em_htg_pln: * 


35 


em_htg_rod : * 


36 


em htg_mam : * 


37 


em_htg_vrt : * 


38 


em_sy : * 


39 


em_htgo_hum: * 


40 


em htgo mus : * 


41 


em htgo_other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed; 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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5 
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4 
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4 
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92 


9 
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6 
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8 


83 


5 
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4 
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7 
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2 


80 


7 
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4 


D0GTGFB1A 


L34956 Canine tran 


8 
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6 


79 


9 
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9 
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M16658 Simian tran 


9 
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6 


79 


9 
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9 


BC022242 
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10 
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6 


79 


9 
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9 


BC000125 


BC000125 Homo sapi 


11 
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6 


79 


9 
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9 
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12 
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6 


79 


9 
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6 


E03028 
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13 
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6 


79 


9 
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14 


955 


79 


8 
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6 
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E00973 cDNA encodi 


15 


954 


6 


79 


7 


1173 


9 


BT007245 


BT007245 Homo sapi 


16 


954 


6 


79 


7 


1173 


12 


BT007866 


BT007866 Synthetic 


17 


952 


4 


79 


6 


1560 


6 


106216 


106216 Sequence 2 


18 


952 


4 


79 


6 


1560 


6 


108268 


108268 Sequence 2 


19 


945 


4 


79 


0 


1173 


4 


ECRGFB1 


X99438 E.caballus 


20 


945 


2 


79 


0 


1176 


6 


AX615127 


AX615127 Sequence 


21 


943 


6 


78 


8 


1176 


6 


AX481432 


AX481432 Sequence 


22 


943 


6 


78 


8 


1176 


6 


AX615128 


AX615128 Sequence 


23 


943 


6 


78 


8 


2745 


9 


HSTGFB1 


X02812 Human mRNA 


24 


939 


78 


4 


1187 


4 


AF175709 


AF175709 Equus cah 


25 


936 


2 


78 


2 


1569 


6 


106221 


106221 Sequence 3 


26 


929 


4 


77 


6 


1561 


6 


108275 


108275 Sequence 3 


27 


889 


4 


74 


3 


1597 


10 


AF191297 


AF191297 Cavia por 


28 


854 


2 


71 


4 


1585 


10 


RNTGFB1 


X524 98 Rat mRNA fo 


29 


847 


8 


70 


8 


1579 


10 


MUSTGFRNA 


M13177 Mouse trans 


30 


847 


.8 


70 


8 


1641 


10 


BC013738 


BC013738 Mus muscu 


31 


847 


8 


70 


8 


2094 


10 


MMU009862 


AJ009862 Mus muscu 


32 


805 


6 


67 


3 


1117 


4 


BOVTGFB 


M36271 Bovine tran 


33 


804 


6 


67 


2 


1125 


10 


AF480858 


AF4 8 0858 Sigmodon 



34 


682 


57 


. 0 


1675 


6 


103310 


103310 Sequence 1 


35 


654 


54 


. 6 


1376 


6 


AX528533 


AX528533 Sequence 


36 


652 . 4 


54 


. 5 


1389 


6 


AX528619 


AX528619 Sequence 


37 


591 . 4 


49 


. 4 


1352 


6 


AX528535 


AX528535 Sequence 


38 


586. 6 


49 


. 0 


1350 


6 


AX528615 


AX528615 Sequence 


39 


570. 6 


47 


. 7 


699 


6 


105434 


105434 Sequence 4 


40 


342 .4 


28 


.6 


489 


6 


AX455100 


AX455100 Sequence 


41 


342 


28 


.6 


1256 


5 


CHKTGFB4 


M31160 Gallus gall 


42 


340.4 


28 . 


.4 


469 


10 


MATGFB1 


X60296 M.auratus m 


43 


308 


25 


. 7 


374 


4 


AF349538 


AF349538 Canis fam 


44 


302 . 8 


25 . 


.3 


148841 


9 


AC011462 


AC011462 Homo sapi 


45 


301.2 


25 


.2 


2381 


9 


HSTGFBG1 


X05839 Human trans 



ALIGNMENTS 



RESULT 1 
AX338213 

LOCUS AX338213 1326 bp DNA linear PAT 09-JAN-2002 

DEFINITION Sequence 1 from Patent WO0181404. 
ACCESSION AX338213 

VERSION AX338213.1 GI: 18128750 

KEYWORDS 

SOURCE Sus scrofa (pig) 

ORGANISM Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Cetartiodactyla / Suina; Suidae; Sus. 
REFERENCE 1 

AUTHORS Strober,W., Nakamura,K., Kitani,A. and Fuss, I. J. 

TITLE Inducible plasmid vector encoding tgf-_g(b) and uses thereof 

JOURNAL Patent: WO 0181404-A 1 01-NOV-2001; 

THE SECRETARY OF THE DEPARTMENT OF HEALTH AND HUMAN SERVICES (US) 
FEATURES Locat ion/Qual l f lers 

source 1. .1326 

/orgamsm="Sus scrofa" 
/mol_type= "genomic DNA" 
/db_xref ="taxon: 9823" 
CDS 16. .1188 

/not e= "unnamed protein product" 
/codon_start=l 
/protein__id="CAD20538 .1" 
/db__xref = "GI : 18128751" 

/ translation="MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAOHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS " 

BASE COUNT 263 a 438 c 392 g 233 t 

ORIGIN 



Query Match 94 . 9%; 

Best Local Similarity 97.8%; 
Matches 1171; Conservative 



Score 1135.8; DB 6; Length 1326; 
Pred. No. 1.7e-198; 
0; Mismatches 2; Indels 24; Gaps 1; 



Qy 


1 


ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 

1 1 1 1 1 II ! 1 1 II M: II 1 M 1 II 1 II I 1 '1 1 M 1 1 1 1 II III II II 1 1 1 II 1 1 1 1 1 1 II 1 

ATGG CGC CTTCGGGG CTGCGG CT CTTG C CGCTG CTG CTG C CG CTG CTGTGG CTG CTAGTG 


60 


Db 


16 


75 


Qy 


61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 

1 1 1 1 1 II 1 1 1 1 1 1 1 1 , 1 1 II 1 '1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II !''l 1 1 1 1 1 1 1 1 1 1 1 1 II 1 III 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


120 


Db 


76 


135 


Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 


180 


Db 


136 


IIMMIIMIIIIIIIIMMMMIIIIIIIIMIMIIIMIIIMIIIIMI III 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTCGCC 


195 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

IIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIII llllllllllll 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 


240 


Db 


196 


255 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

M M MM II II II Ml III 1 Ml IM 1 M I M 1 M 1 III 1 II III 1 1 II 1 1 1 1 1 1 1 1 1 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


300 


Db 


256 


315 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


360 


Db 


316 


1 1 1 II 1 1 1 1 1 M M 1 1 M 1 1 II 1 M ! 1 II 1 M M M 1 M II II 1 1 ! 1 1 M 1 1 1 1 1 1 M 1 1 

GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


375 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

1 1 M II II 1 1 IMM II III Ml 1 II MMM! i M 1 II 1 II II III MM 1 M II 1 1 1 1 

TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 


Db 


376 


435 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


480 


Db 


436 


1 M Ml M M II 1 1 M 1 M 1 Ml M 1 1 II II 1 M II 'M M M MM II II 1 M 1 MM 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


495 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


540 


Db 


496 


M Ml II M MMM MMM M III MM M M 1 MM IM M 1 MM 1 M III M M MM 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


555 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

1 IM 1 II M II II 1 M II II 1 1 II II Mill MM Ml II Ml 1 III 1 II III M II 1 1 M 

TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


600 


Db 


556 


615 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

1 M 1 1 1 II 1 1 1 1 1 II 1 1 1 1! M 1 1 IM 1 Ml 1 M 1 1 1 MMM MM II 1 II M 1 II 1 1' 

ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


660 


Db 


616 


675 


Qy 


661 


GCCC^CTCTTCCTCTGA(^GCAAAGATAAC^CACTCC^CGTGGAAATTAACGGGTTCAAT 

IMIMIIIMlllllililllllMIIIMM IIIIIIIIIIIIIIIMIIIIIMII 

GCCCACTCTTCCTCTGAC^GCAAAGATAAC^CACTCCACGTGGAAATTAACGGGTTCAAT 


720 


Db 


676 


735 


Qy 


7z 1 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 


780 


Db 


736 


III Mill Ml M ill M MIMM MIMi Ml 1 MMMIMMl IMMMIIIMI 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 


795 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 


840 


Db 


796 


IIIIIIIIIIIMMIIIIIIIIMMMIIIIIMIMIIIIIMIIIMMIIIIMI 

ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 


855 


Qy 


841 


GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 


900 



i; 1 1 1 1 Ml 1 1! 1 1 1 1 1 i 1 1 1 1 M 1 1 IMIIIIIII 

Db 856 GATACCAACTACTGCTTCAGCTCCAC GGAGAAGAAC 8 91 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

1 : 1 1 1 Mill II 1 1 1 1 1 1 ! Ml 1 1 1 1 . 1 , 1 1 1 M 1 1 1 1 1 III III II I Ml i II 1 1 1 1 1 1 

Db 8 92 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 951 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

1 1 1 1 1 j i f 1 1 f 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 i j 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 

Db 9 52 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1011 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1012 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1071 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

I I M I MIIIIMM 1 1 Ml I M I M II MM 1 1 1 1 1 1 M Ml II II M III I M I II 

Db 1072 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1131 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

I I I M III 1 1 1 Ml M 1 1 1 1 1 1 IM III 1 1 M 1 1 M II I M 1 1 1 1 1 1 1 1 1 1 II I M 

Db 1132 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 118 8 



RESULT 2 
AF461808 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



AF461808 2221 bp mRNA linear MAM 03-JAN-2002 

Sus scrofa transforming growth factor beta 1 (TGFB1) mRNA, complete 
cds . 

AF461808 

AF461808.1 GI:18042250 

Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla ; Suina; Suidae; Sus. 

1 (bases 1 to 2221) 

Wimmers,K., Chomde j , S . , Ponsuksil i , S . and Schellander, K. 
Polymorphism in the porcine transforming growth factor beta 1 gene 
Unpublished 

2 (bases 1 to 2221) 

Wimmers,K., Chomde j , S . , Ponsuksil i , S . and Schel lander , K . 
Direct Submission 

Submitted (20-DEC-2001) Institute of Animal Breeding Science, 
University of Bonn, Endenicher Allee 15, Bonn 53115, Germany 

Location/Qual if iers 

1. .2221 

/organi sm= " Sus scrofa " 

/ mo 1 _t yp e = " mRNA " 

/db_xref ="taxon: 9823" 

/chromosome^ " 6 " 

1. .2221 

/gene=" TGFB1 " 

1. .1173 

/ gene= " TGFB1 " 

/note=" cytokine" 

/codon start=l 



/product^" transforming growth factor beta 1" 
/protein_id="AAL57902 . 1" 
/db_xref="GI : 18042251" 

/ 1 rans 1 at ion* M MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
3'UTR 1174. .2221 

/ gene= "TGFB1 " 

BASE COUNT 504 a 616 c 649 g 452 t 

ORIGIN 

Query Match 94.5%; Score 1131; DB 4; Length 2221; 

Best Local Similarity 97.6%; Pred . No. 1.3e-197; 



Matches 


1168; Conservative 0; Mismatches 5; Indels 24; Gaps 


i 


Qy 


1 


ATGG CG C CTTCGGGG CTG CGG CTCTTG CCG CTG CTG CTG C CG CTG CTGTGG CTG CTAGTG 

III 1 1 1 1 1 i II 1 1 1 1 1 1 1 II 1 1 II M 1 1 .'.'1 1 1 1 1 1 M 1 1 II 1 1 1 II 1 1 Ml 1 i 1 1 II 

ATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 


60 


Db 


1 


60 


Qy 


61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


120 


Db 


61 


IIMIIM Mill MIIMIMIMIMMIMM! IIMMIM! Ml IIIIIIIMM 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


120 


Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

1 1 1 M 1 M M 1 1 1 1 1 1 1 1 1 MM 1 1 M II II III 1 1 M 1 II 1 III 1 1 II 1 1 ! II 1 1 III 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTCGCC 


180 


Db 


121 


180 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

1 1 1 1 1 M 1 1 M M 1 1 MMI Ml 1 III M 1 1 Ml 1 II II II II II M 1 MINIMUM 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 


24 0 


Db 


181 


240 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

II II 1 1 II 1 1 1 1 M MM II II II II M 1 1 II II II M 1 1 1 II 1 1 M 1 1 M 1 M M II 1 1 1 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


300 


Db 


241 


300 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

1 II 1 1 1 II II 1 M II MM MM II 1 1 II II M II Ml 1 MM III MM 1 1 1 MM II Ml 

GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


360 


Db 


301 


360 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 


Db 


361 


IM 1 II M M M 1 1 II 1 II IM II M M II MM II II MM Ml M II 1 M 1 1 1 1 M 1 1 1 

TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCOCrTnrTrJAGGCTC 

II 1 Ml 1 1 1 III MM M llllll 1 1 III 1 MM, III MM III IMM II IM III M 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


480 


Db 


421 


480 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


540 


Db 


481 


M MM MMII 1 1 II 'illlMIM 1 1 1 II M Ml MM 1 M IMM 1 MM 1 MM MM'I 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


540 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

M 1 II II 1 M M M III 1 1 1 M III M 1 1 1 1 II 1 II 1 1 1 1 II MMI 1 1 1 II 1 MM 1! M 


600 



Db 



541 



TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 



600 



Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


877 


Qy 


961 


Db 


937 


Qy 


1021 


Db 


997 


Qy 


1081 


Db 


1057 


Qy 


1141 


Db 


1117 



601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 66 0 



ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 66 0 



GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

Mlllll Mill M I !l 1 1 1 1 1 1 1 'I III I II I 'i 1 1 II 1 1 1 II I II 1 1 1 1 1 1 II ll I 

GCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 
TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

1 1 1 1 1 1 1 II I II 1 , 1 1 1 1 1 1 1 1 1 1 1 h 1 1 II 1 1 1 1 II I II I M I M 1 1 1 < 1 1 1 M 1 1 1 1 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 



720 



720 



780 



780 



II 



MIIIIIIIMIIIIMIMIMIII 



GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 

1 1 1 1 1 1 1 h I M I MM 1 1 1 1 1 1 1 1 1 MINIUM 

GATACCAACTACTGCTTCAGCTCCAC GGAGAAGAAC 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 



900 



876 



960 



936 



GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

M M I M I M M M M M M M I M M M M M M I M M M I M M M M M M M M I 

GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 996 
GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1056 
GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1116 
AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

I M f 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1173 



RESULT 3 

PIGTGFB1A 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



PIGTGFB1A 3206 bp mRNA linear 

Sus scrofa transforming growth factor beta-1 mRNA, 
M23703 

M23703 . 1 GI :755044 
transforming growth f actor-beta-1 . 
Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Cetartiodactyla; Suina; Suidae; 
1 (bases 1 to 3206) 

Kondaiah,P., Van Obberghen- Schilling, E . , Ludwig,R.L 
Sporn,M.B. and Roberts, A. B. 



MAM 31-MAR-1995 
complete cds . 



Euteleostomi ; 
Sus . 

, Dhar,R., 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



gene 
CDS 



polyA_signal 
polyA site 



cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 

Evidence for alternate splicing and polyadenylat ion 

J. Biol. Chem. 263 (34), 18313-18317 (1988) 

89054010 

2461367 

On Apr 1, 1995 this sequence version replaced gi: 341017. 

Original source text: Sus scrofa (strain miniature swine) cDNA to 

mRNA . 

Location/Qualifiers 
1. .3206 

/organism= "Sus scrofa" 
/ mo 1 _ t yp e = " mRNA " 
/strain="miniature swine" 
/db_xref="taxon: 9823" 

/cell_type= "peripheral blood lymphocyte" 
1. .3206 

/gene= " TGF -bet a - 1 " 
906. .2078 
/gene= "TGF-beta- 1 " 
/codon_start=l 

/product =" trans forming growth f actor-beta-1 " 
/protem_id="AAA64616 . 1" 
/db_xref="GI : 755045" 

/ translation "MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVTiALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS " 
3186. .3191 



BASE COUNT 
ORIGIN 



/gene="TGF-beta-l" 
3206 

/gene= "TGF-beta - 1 " 
645 a 1041 c 924 g 



596 t 



Query Match 94 . 5%; 

Best Local Similarity 97.6%; 
Matches 1168; Conservative 



Score 1131; DB 4; Length 32 06; 
Pred. No. 1.2e-197 ; 
0; Mismatches 5; Indels 24; 



Gaps 



l; 



Qy 



Db 



1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III 1 1 1 1 1 1 1 1 1 , 1 1 1 1 1 Mi I II 1 1 1 1 1 1 , 1 1 1 1 : II I Ml I II MM I I I || I II I I 

906 ATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 965 



Qy 

Db 

Qy 

Db 

Qy 

Db 



61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

I M i M I MM M 1 M 1 1 M I IMM 1 1 M> 1 1 MM I M i Ml MIM M 1 1 MM 1 1 MMM 1 1 i 

966 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 1025 



121 



180 



GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

MM M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 III M 1 1 1 1 1 1 MM I M iMI I Ml 1 1 1 1 1 1 1 1 1 III 

102 6 GTGAAG CGGAAG CG CAT CGAGG C CATT CG CGG CCAGATTCTGTCCAAG CTTCGG CTCG CC 1085 
181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

MIMI I M 1 1 1 M 1 1 M I IM I II 1 1 1 1 1 M II M 1 1 MM 1 1 III IMIIMIMM 

108 6 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 114 5 



Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

IIIIIIIIIIIIMIIIIIMIIIMMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 

Db 114 6 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 12 05 

Qy 3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

I I M I'.'! II 1 1 1 1, 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 , ! l II I h 1 1 1 1 II I 'I! Ill 1 1 II I 

Db 12 06 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 12 65 

Qy 3 61 T ATG AT AAATT CAAGGG CA CCCCCCACAG CTTATATATG CTGTT CAA CA CGT CG GAG CTC 42 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIII 

Db 12 66 TATGATAAATTC^AGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 132 5 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 48 0 

IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 132 6 CGGG AAG CGGTG C CGG AA C CTGTATTG CT CT CT CGGG CAGAG CTG CG CCTG CTGAGG CTC 13 8 5 

Qy 481 AAGTTAAAAGTGG AG CAG CA CGTGG AG CTATAC CAGAAATACAG CAATGATT CCTGG CG C 54 0 

IIIIIIIIIMIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIMIIIIIIIIII 

Db 13 86 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 144 5 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

i 1 1 i 1 ' : i j i i i i 1 , i i i . : , i i i i , 1 i 1 i , 1 i i i 1 i ' i . i 1 i' : 1 i 1 i i : i i i ; ) i i j 

Db 144 6 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 15 05 

Qy 6 01 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 

Db 15 06 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 1565 

Qy 661 GCCC^CTCTTCCTCTGAC^GCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

Illllll Mill IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMM 

Db 1566 GCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGG 1625 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I I I m 1 1 1 1 ill i;i i ii 1 1 1 1 ii i ii i 1 1 1 1 1 1 , :i i ii i;i i 1 1 ii m 1 1 1 1 1 1 1 :n 

Db 1626 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 168 5 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 

MMIIMMIIIIMIMIIIIIIMIIIIIMIMIIIIMIIIMIIMIIIIMM 

Db 168 6 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 174 5 

Qy 841 GATACCAA CTACTG CTTCAG CTC CACGGACTACAAGGATGA CGACGA CAAGGAGAAGAAC 9 00 

; i : 1 1 : n 1 1 1 1 1 'i 1 1 1 1 1 , 1 1 1 1 iiiiiiiiii 

Db 174 6 GATACCAACTACTGCTTCAGCTCCAC GGAGAAGAAC 1781 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 96 0 

m 1 1 1 u 1 1 ii 1 1 1 1 ill 1 1 1 ! 1 1 1 ii; 'i ii 1 1 1 1 1 1 ii i ii i ii i ii 1 1 1 ill 1 1 ill 

Db 1782 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 1841 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

M 1 1 1 M M 1 1 1 II I M i I ( I II II 1 1 ! I M M M I II I II II M I II 1 1 M M M M M 

Db 184 2 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1901 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

M I II 1 1 1 INI 1 1 1 III 1 1 1! 1 1 1 1: IN II 1 1 1 1 1 1 II I ii I II I II I III Mil III 

Db 19 02 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1961 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 



1 1 111 1 ii i ii 1 1 1 1 ii 1 1 1 1 1 1 1 1 'i 1 ii 1 1 1 1 1 1 1 1 ih 1 1 1 1 1 m ■ i: 1 1 : 1 1 1 : i 

Db 1962 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 2021 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

1 1 1 1 Mi 1 1 1 1 M Ml 1 1 1 1 , 1 1 1 1 1, , 1 1 1 1! 1 1 1 1 1 1 II, 1 1 1 1 II, I II 1 1 1 1 1 1 

Db 2 022 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 2078 



RESULT 4 

SSTGFBR 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
PUBMED 
FEATURES 

source 



CDS 



mat_peptide 



SSTGFBR 1605 bp mRNA linear MAM 27 -MAR- 1995 

Porcine mRNA for transforming growth factor-beta (TFG) precursor. 
Y00111 

Y00111.1 GI:2129 
transforming growth factor-beta. 
Sus scrofa (pig) 
Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 
1 (bases 1 to 1605) 
Derynck,R. and Rhee,L. 

Sequence of the porcine transforming growth factor-beta precursor 

Nucleic Acids Res. 15 (7), 3187 (1987) 

87174844 

3470708 

Location/ Qual i f iers 
1. .1605 

/organism= M Sus scrofa" 
/ mo 1 _ t yp e = " mRNA " 
/db_xref="taxon: 9823" 
/t issue_type= " ovary" 
/clone_lib=" lambda gtlO" 
404. .1576 

/note="TFG-beta precursor (AA 1-390)" 

/codon_start=l 

/protein_id="CAA68291. 1" 

/db_xref="GI :2130" 

/ db_xr e f = " SWISS- PROT : P 0 7 2 0 0 " 

/translation= " MP PSGLRLLPLLLPLLWLL VLTPGR PAAGLSTCKT I DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMLESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFR 
LSAHCSCDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS " 
1238. .1573 



misc feature 



misc feature 



BASE COUNT 
ORIGIN 



/product="mature TFG-beta" 
647. .655 

/note="pot. N-glycosylation site" 
809. .817 
/note="pot. N-glycosylation site" 
298 a 572 c 437 g 298 t 



Query Match 94.4%; Score 1129.4; DB 4; Length 1605; 

Best Local Similarity 97.5%; Pred. No. 2.6e-197 ; 

Matches 1167; Conservative 0; Mismatches 6; Indels 24; Gaps 1; 



Qy 


l 


Db 


404 


Qy 


61 


Db 


464 


Qy 


121 


Db 


524 


Qy 


181 


Db 


584 


Qy 


241 


Db 


644 


Qy 


301 


Db 


704 


Qy 


361 


Db 


764 


Qy 


421 


Db 


824 


Qy 


481 


Db 


884 


Qy 


541 


Db 


944 


Qy 


601 


Db 


1004 


Qy 


661 


Db 


1064 


Qy 


721 


Db 


1124 


Qy 


781 


Db 


1184 



ATGG CG CCTTCGGGGCTG CGG CTCTTG CCGCTG CTGCTG CCG CTG CTGTGGCTG CTAGTG 60 

III I lll'l Ml! 1111:11, Ml I III MM hi I Mill INI III III MMI III 

ATGCCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 463 



1 1 1 1 1 1 1 1 , 1 1 !! 1 1 M 1 1 1 1 . 1 1 1 1 1 1 1 1 1 1 1 1 i I < 1 1 1 ! 1 1 1 . 1 1 1 . 1 1 , 1 1 1 1 1 1 1 



GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

II II 1 1 1 M II 1 1 1 II M I M I II 1 1 Ml II I II I M II Ml I M I M I II M M 1 1 1 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTCGCC 583 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

M M 1 1 ! II II II II Ml 1 1 1 1 1 II 1 1 1 1 M M III I MMM Ml I llllllllilll 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 64 3 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 30 0 

! 1 1 1 1 1 1 1 [ 1 1 1 ' I i 1 1 1 1 1 1 1 1 ! 1 1 1 !. 1 1 1 ! 1 1 1 j 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 : 1 1 ! 1 1 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 7 03 



I M I II II I MMM M MM I II 1 1 II M I M I II I II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

M II MIMMIIMIMIM IM II MMMIMI MMM MM III Ml IM IM II 

TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 823 



M II I III II MM Ml MM I II 1 1 1 II 1 1 II I II I III II I II II II III M I III II 



AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

II I M I II M Ml 1 1 II I M I M I II M MM 1 1 M M MM 1 1 1 1 M MM 1 1 1 1 II I II 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 943 



I II M MM II M M IM MM MM 1 1 II M M I M 1 1 MM' I IM II 1 1 II 1 1 MMM 



ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

M M I III M I III MM I M I II I M II 1 1 1 II IM Ml IM II I M IM MM MM II 

ACCGGAGTTGTGGGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 1063 



MINI Mill III MM II 1 1 1 1 1 1 1 1 1 1 1 M M IMM I M I M 1 1 1 II 



TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 780 

1 1 M I III 1 1 1 II M I II MM II II I III M II M 1 1 M IM I M 1 1 1 1 1 1 M Ml I M 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 1183 



M 1 1 IMI M MM III MM MMI 1 1 II I M I M 1 1 MM I IM 1 1 1 1 1 1 1 1 1 M 



Qy 841 GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 900 

1 1 1 1 1 1 1 hi 1 1 1 1 i 1 1 1 II 1 1 1 1 1 1 llllllllll 

Db 1244 GATACCAACTACTGCTTCAGCTCCAC GGAGAAGAAC 127 9 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

1 1 1 II 1 1 1 [ II I II 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 M I II II 1 1 1 

Db 12 8 0 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 1339 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

1 1 1 1 1 1 1 1 : i 1 1 1 1 : i 1 1 1 M 1 1 1 1 1 1. 1 ii 1 1 1 1 1 1 1 1 : 'i i ii 'M ii i !i 1 1 ii i:i 

Db 134 0 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1399 

Qy 1021 GAC^CTC^GTAC^GC^GGTCCTGGCTCTGTAa^CCAGCACAACCCGGGCGCGTCGGCG 108 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 14 00 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 14 59 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

1 1 1 1 II 1 1 II 1 1 1 i M 1 1 1 II , II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 i III II Ml II M'l 1 1 1 1 M 

Db 14 60 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCAGTGCCCATCGTGTACTACGTGGGCCGC 1519 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

i ';i i m :.i i in i m 1 1 1, 1 1 1 ii i . i ii ii i i 1 1 1 1 1 1 1 1 1 m ii i ii 1 1 1 1 

Db 152 0 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1576 



RESULT 5 

GGTGFB1 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



GGTGFB1 1750 bp mRNA linear MAM 27-MAR-1996 

Porcine mRNA for transforming growth factor-beta 1. 

X12373 

X12373.1 GI:63808 

transforming growth factor-beta 1. 

Sus scrofa (pig) 

Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetart iodactyla ; Suina ; Suidae; Sus. 

1 (bases 1 to 1750) 

Jakowlew, S .B . , Dillard, P . J . , Sporn,M.B. and Roberts, A. B. 
Nucleotide sequence of chicken transforming growth factor-beta 1 
(TGF-beta 1) 

Nucleic Acids Res. 16 (17), 8730 (1988) 

88335639 

3166520 

2 (bases 1 to 1750) 
Jakowlew, S . B . 
Direct Submission 

Submitted ( 14 - JUL-1988 ) Jakowlew S.B., National Institute of 
health, National Cancer Institute, Laboratory of Chemoprevention . 
Building 41, Room B902, Bethesda, Maryland 20892, USA 
The submitters believe that the chicken cDNA library was 
contaminated with porcine cDNA, and that the sequence is infact 
porcine TGF-beta-1. 27-MAR-1996. 

Locat i on/Qua 1 if iers 

1. .1750 

/ organi sm= " Sus scrofa " 
/mol_type= "mRNA" 
/strain="white leghorn" 



/db_xref= M taxon: 9823" 

/clone="pTGFB-ChX119" 

/eel l__type = " chondrocyte" 
gene 1. .1750 

/gene="TGF-beta 1" 
5'UTR 1. .446 

/gene="TGF-beta 1" 
CDS 447. .1622 

/gene="TGF-beta 1" 

/codon_start=l 

/product= "transforming growth factor" 
/protein_id="CAA30933 . 1" 
/db_xref="GI : 63809" 
/db_xref = "SWISS-PROT: P07200" 

/ 1 rans 1 a t ion= " MPPSGPGLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVIALYNSTRDRVAGESVEPEPEPEAD 
YYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNDSWGYLSNRLLAPSDS PEWLSFDVTGWRQWLTRREAT EGFR 
LSAHCSCDSKDNTLHVEINAGFNSGRRGDIuATIHGMNRPFLLLMATPLERAQHLHSSR 
HRRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLD 
TQYSKVIALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
mat_peptide 1285. .1619 

/gene="TGF-beta 1" 
3'UTR 1620. .1750 

/gene="TGF-beta 1" 

BASE COUNT 325 a 627 c 479 g 319 t 

ORIGIN 



Query Match 92.9%; Score 1111.6; DB 4; Length 1750; 

Best Local Similarity 97.0%; Pred. No. 4.7e-194; 

Matches 1164; Conservative 0; Mismatches 9; Indels 27; Gaps 2; 



Qy 


i 


ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 

III MINIMUM 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 II 1 i 1 1 1 1 1 1 

ATGCCGCCTTCGGGGCCTGGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 


60 


Db 


447 


506 


Qy 


61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


120 


Db 


507 


MMMMMMMMMMMMMIMMMMMMMMMMMMMMMMI 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


566 


Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1! II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 Ml 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTCGCC 


180 


Db 


567 


626 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

IIIMIIIIIIIIIIIIIMIIIIIIIIIIIMMIIMIIIIIIII IIIIIIIMIII 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 


240 


Db 


627 


686 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


300 


Db 


687 


IMIMIIIMMMMMIIMMMIIMMIIMIIIMMIMIIIMIIMIIII 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


746 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

IMMIIIIMIIIMIIMIIIIMMMMMIIIMIIMIIMMIIMMIIIII 

GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


360 


Db 


747 


806 



Qy 



361 



TATGATAAATT(^GGG(^CCCCC(^CAGCTTATATATGCTGTTCAACACGTCGGAGCTC 



420 



Db 


807 


Qy 


421 


Db 


867 


Qy 


481 


Db 


927 


Qy 


541 


Db 


987 


Qy 


601 


Db 


1047 


Qy 


661 


Db 


1107 


Qy 


718 


Db 


1167 


Qy 


778 


Db 


1227 


Qy 


838 


Db 


1287 


Qy 


898 


Db 


1323 


Qy 


958 


Db 


1383 


Qy 


1018 


Db 


1443 


Qy 


1078 


Db 


1503 


Qy 


1138 


Db 


1563 



IIIIMMIIMIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIMI 



CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 80 

MIIIIIIIIIIIIMIIIIMMIIIIIIIIMIIIIIIIIMIIIIIIMMIIIIII 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 926 

481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

llllllllilMIIIIIIIIIIIIIMIIIIillllillllilllllllllllllll II 
92 7 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGGGC 986 

TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 104 6 

ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

IMIIIIMIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIMIIIIIIIIMIIIIII 

ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 1106 
661 GCCCACTCTTCCTCTGAC^GCAAAGATAACACACTCCACGTGGAAATTAAC GGGTTC 717 

III!!! Mill II I ! , I lllll ! ! ! I ! Ill ! ! ! ! ! Ml II! I ! ' !M Mllll 

1107 GCCCACTGTTCCTGTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGCAGGGTTC 1166 
AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTC 777 

MMMMMMMIMMMMMMMMMMMMMMMMMMMMMMI 

AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTC 1226 
CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCC 837 

MMMMMMMMMM MMMMMMMM MMMMMMMMMMMM 

CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCC 1286 
8 3 8 CTGGATA C CAA CTACTG CTTCAG CT CCACGGACTACAAGGATGA CGA CGA CAAGGAGAAG 8 97 

M M I II 1 1 M I M 1 1 1 M II I M M I M lllllll 

128 7 CTGGATACCAACTACTGCTTCAGCTCCAC GGAGAAG 1322 

AACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATT 957 

I M 1 1 1 1 1 1 1 1 1 1 1! 1 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 M I M 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 

AACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATT 1382 
CATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGC 1017 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

CATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGC 1442 
CTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCG 1077 

IMMIIMMIMMIMMMMIIIIMIIIIIIIIIMMIIIIMIIIIIMIII 

CTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCG 1502 
GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGC 1137 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! I 

G CGG CG C CGTG CTG CGTG C CG CAGG CG CTGG AG C CACTG CC CAT CG TG T ACT ACGTGGG C 1562 
CG CAAG CC CAAGGTGGAG CAG CTGTC CAACATGAT CGTG CGTTCCTG CAAGTG CAG CTGA 1197 

I I M M 1 1 1 1 1 1 1 M M 1 1 ) 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 1 1 1 M 1! 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 

CGCAAGCC(^GGTGGAGC^GCTGTC(^CATGATCGTGCGTTCCTG(^GTGaVGCTGA 1622 



RESULT 



6 



0ATGFB1 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



Chordata; Craniata,- Vertebrata; Euteleostomi ; 
Cetartiodactyla; Ruminant ia; Pecora; Bovoidea; 
Ovis . 



mat__peptide 



BASE COUNT 
ORIGIN 



OATGFB1 1173 bp mRNA linear MAM 18-APR-1995 

O.aries mRNA for transforming growth factor-beta I. 

X76916 

X76916. 1 GI :496648 

TGF-beta 1; transforming growth factor-beta 1. 
Ovis aries (sheep) 
Ovis aries 
Eukaryota ; Metazoa ; 
Mammalia; Eutheria; 
Bovidae; Caprinae; 
1 

Woodall ,C. J. , McLaren,L.J. and Watt, N.J. 

Sequence and chromosomal localisation of the gene encoding ovine 
latent transforming growth factor-beta 1 
Gene 150 (2), 371-373 (1994) 
95121932 
7821809 

2 (bases 1 to 1173) 
Woodall, C. 
Direct Submission 
Submitted (24 -DEC- 1993 ) C. 
Veterinary Pathology, Sc. 
Edinburgh EH9 IQH, UK 

Locat ion/Qual i f iers 

1. .1173 

/organismV'Ovis aries" 
/ mo 1 _ t yp e = " m R N A " 
/dbjcref = H taxon: 994 0" 
1. .1173 
/codon_start=l 

/product="transforming growth factor-beta 1" 
/protein_id="CAA54242 . 1" 
/db_xref= M GI : 4 9664 9" 
/db_xref="SWISS-PROT: P50414" 

/ 1 rans 1 a t i on= " MP PSGLRLLPLLLPLLWLLMLTPGR P VAGLSTCKTI DMELVKRK 
GIEAIRGQILSKLRLASPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEAD 
Y YAKE VTRVLMVEYGNKI YDKMKSSSHSI YMFFNTS ELREAVP E P VLLSRADVRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFR 
LSAHCSCDSKDNTLQVDINGFSSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS " 
835. .1170 

/product=" transforming growth factor-beta 1" 
378 c 336 g 214 t 



Woodall, Univ. of Edinburgh, Dept. of 
of Vet. Studies, Univ. of Edinburgh, 



245 a 



Query Match 83.5%; Score 999.8; DB 4; Length 1173; 

Best Local Similarity 90.7% ; Pred. No. 1.6e-173; 

Matches 1086; Conservative 0; Mismatches 87 ; Indels 24; Gaps 1; 

Qy 1 ATGGCG CCTTCGGGG CTG CGG CTCTTG C CG CTG CTG CTG C CG CTG CTGTGG CTG CTAGTG 60 

III lilllllllllllllllll IIIIIIIIIIIIIIIMIIIIIIIIIII III II 
Db 1 ATG CCG C CTTCGGGG CTG CGG CTG CTG C CG CTG CTG CTG C CG CTG CTGTGG CTACTAATG 60 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

IMIIMIMIIIIIIIII 1 1 '1 1 1 1 1 1 r 1 1 1 I 1 MM Ml I III II II I. 1 'II 



Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


877 



61 CTGACGCCTGGCCGGCCGGTCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 
'GAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

MINIMI! IMIIIMIMM Mill IIMM IIIIMI MINIM III 



AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

1 1 1 1 1 1 1 1 1 ! II 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 1 Ml 

AGTCCCCCGAGCCAGGGGGACGTGCCACCCGGCCCGCTGCCCGAGGCCATACTGGCCCTT 24 0 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 : ! 1 1 1 1 1 1 ! I MM IIIIMI MINIMI 

TACAACAGTACCCGCGACCGGGTGGCCGGGGAAAGTGCCGAAACGGAGCCTGAGCCAGAG 3 00 



M M M 1 1 M 1 1 M 1 1 1 M M i M 1 1 1 M M I 



TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

INN IN INI III MINI i 1 1 1 1 1 1 ! I 1 1 1 1 1 1 M I M 1 1 1 1 1 1 

TATGACAAAATGAAGTCTAGCTCGCACAGCATATATATGTTCTTCAACACGTCCGAGCTC 4 20 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 

IIIINII INN 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M ! 1 1 1 1 1 

CGGGAAGCAGTGCCTGAACCTGTGTTGCTCTCTCGGGCAGACGTGCGCCTGCTGAGGCTC 4 8 0 



1 1 II M I II M II M 1 1 M II I M I I M I II N I II INN MIMINNI 



TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

1 1 II 1 1 M III I, U 1 1 1 1 IIIIMI! Ml I , I ' 1 1 1 , 1 1 1 1 ! I M 1 1 1 1, 1 1 IN 

TACCTCAGCAACCGGCTGCTCGCCCCCAGCGACTCACCGGAGTGGCTGTCCTTTGACGTC 60 0 

ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 66 0 

II I I I I I I II II I I I I I I I I I I I! I I I II. I' INN II NINNINN 
ACTGGAGTTGTGCGGCAGTGGCTGACCCACAGAGAGGAAATAGAAGGCTTTCGCCTCAGT 66 0 



NNIN INN WWW II 'I'll, 1 1 N INN II IIMNINI I 



TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

II M 1 1 1 II 1 1 1 1 1 1 1 1 1 III MINIUM MINI llMlllliMI MINIM 

TCCGGCCGCCGGGGTGACCTCGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 780 



Nil 



GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 900 

II I M I I I I II I I I I I I I M I I I I I II WWW 

GACACCAACTACTGCTTCAGCTCCAC AGAAAAGAAC 876 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

INN II II I I I I I I I I ,M I I II I II II I 1 III I I I 'l I I II I I, I II I Ml I II 
TGCTGTGTTCGTCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAC 93 6 



Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

' 1 1 1 1 1 1 1 1 1 1 ': 1 1 1 1 1 M i ii i m 1 1 1 1 1 1 1 1 m i ii ii 1 1 1 ii 1 1 1 1 1 1 1 1 M 

Db 937 GAACCCAAGGGCTACCACGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTG 996 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

Mill 1 1 1 1 1 1 1 1 1 II I II II 1 1 1 1 II II lllhllM Mll.'lUII I llllll 

Db 9 97 GACACACAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCACAACCCGGGCGCATCGGCG 1056 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1140 

1 1 1 1 1 1 1 II 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1057 GCGCCGTGCTGCGTGCCTCAGGCGCTGGAACCCCTGCCCATCGTGTACTACGTGGGCCGC 1116 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

Db 1117 AAGCCCAAGGTGGAGCAGTTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1173 



RESULT 7 
DOGTGFB1A 

LOCUS DOGTGFB1A 1369 bp mRNA linear MAM 3 0 -OCT- 1994 

DEFINITION Canine transforming growth factor-beta 1 (TGFB1) mRNA, complete 
cds . 

ACCESSION L34 95 6 

VERSION L34956.1 GI: 516071 

KEYWORDS homologue; transforming growth factor-beta 1. 
SOURCE Canis familiaris (dog) 

ORGANISM Canis familiaris 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 
REFERENCE 1 (bases 1 to 1369) 

AUTHORS Manning, A.M. , Auchampach, J . A . , Drong,R.F. and SI ightom, J . L . 
TITLE Cloning of a canine cDNA homologous to human transforming growth 

factor-beta 1 (TGFbetal) 
JOURNAL Unpublished (1994) 
COMMENT Original source text: Canis familiaris adult jugular vein 

endothelial cDNA to mRNA. 
FEATURES Locat ion/Qual i f iers 

source 1. .1369 

/organism="Cams familiaris" 
/ mo 1 __ t yp e = 11 mRNA " 
/db_xref="taxon: 9615" 
/eel l_type= "LPS -activated" 
/tissue__type=" jugular vein endothelial" 
/dev_s tage= "adul t 11 
gene 1. .1369 

/gene="TGFBl" 
5'UTR 1. .57 

/gene="TGFBl" 
CDS 58. .1230 

/gene="TGFBl" 

/ f unct ion=" ant i- inflammatory agent" 
/not e= "precursor" 
/codon__start = l 

/product="transf orming growth factor-beta 1" 
/protein_id="AAA51458 . 1" 
/db_xref ="GI : 516072" 



mat_peptide 



3 ' UTR 



BASE COUNT 
ORIGIN 



/translation= " MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTI DMELVKRK 
RIEAIRGQILSKLRLSSPPSQGEVPPVPLPEAVIALYNSTRDRVAGESAEPEPEPEAD 
Y YAKEVTRVLMVENTNKI YEKVKKS PHS I YMLFNTSELREAVPEPVLLSRAELRLLRL 
KLKAEQHVELYQKYSNDSWRYLSNRLLAPSDTPEWLSFDVTGVVRQWLSHGGEVEGFR 
LSAHCSCDSKDNTLQVDINGFSSSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQ 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
Q YS KVLAL YNQHN PGAS AA P CC VPQALE PL P I VY Y VGR KP KVEQLSNM I VRS CKCS 11 
1039. .1227 
/ gene= " TGFB1 " 
/product =" trans forming growth factor-beta 1" 
1231. .1369 
/gene="TGFBl" 
264 a 473 c 415 g 216 t 1 others 



Query Match 80.7%; 
Best Local Similarity 89.0%; 
Matches 1065; Conservative 



Score 966 .2; DB 4; 
Pred. No. 2.3e-167; 
0; Mismatches 108; 



Length 1369; 
Indels 24; Gaps 



1; 



Qy 


l 


Db 


58 


Qy 


61 


Db 


118 


Qy 


121 


Db 


178 


Qy 


181 


Db 


238 


Qy 


241 


Db 


298 


Qy 


301 


Db 


358 


Qy 


361 


Db 


418 


Qy 


421 


Db 


478 


Qy 


481 


Db 


538 



ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III 1 1 1 1 II lllllllllll IIIIIIIIIIIIMIIMMIII 1 1 1 1 llllll 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTGCCGCTGCTGCGGCTACTAGTG 117 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

1 1 ll 1 1 1 1 II 1 1, 1 1 1 1 1 , i . 1 ' 1 1 1 1 ;i 1 1 , 1 1 1 1 1 IN || I Ml 1 1, |,i M Ml 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAACTG 177 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

MIIIMIMMIIIIIMMIMM lllllllllll lllllllllll Mill 1 1 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCTCC 237 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IMIIIIIIIIIIIIIIMI llllllllll IIIIMII Mill II Mill II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGTGCCGCTGCCCGAGGCCGTGCTGGCCCTC 2 97 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

IMIIIM 1 1 II 1 1 1 1 II 1 1 1 1 II Mill M I III IMMMMIMM III 

TACAACAGCACCCGCGACCGGGTGGCGGGGGAGAGCGCCGAGCCGGAGCCCGAGCCCGAG 357 

GCGGACTACTACGCCAAGGAGGTCA.CCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

1 1 II I II II 1 1 1 1 1 1 1 1 M M II 1 1 1 1 1 II II I II I M I II 1 1 1 1 1 I MM Mill 

GCGGACTACTACGC(^GGAGGTC^CCCGCGTGCTAATGGTGGAAAA(^CCAAC^AAATC 417 

TATGATAAATTC^GGGC^CCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill iii mil i ii iiiiii Milium iMMiii ii mm 

TATGAGAAAGTC^GAAAAGTCCGC^CAGC^TATATATGCTCTTCAAC^CATCAGAGCTC 4 77 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 

M Mill Mill II II II IMMIII MM I II II llllll llllll III II 

CGAGAAGCAGTGCCTGAGCCCGTCTTGCTCTCCCGGGCAGAGTTGCGCCTGCTGAGGCTC 537 



III 



Mill 



Qy 



541 



TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

IIIIIIIIIIIIIMIIIIMII Mill III I M II IIIIIIIIIIIIMIIM 



600 



Db 



598 



TACCTCAGCAACCGGCTGCTGGCGCCCAGCGACACGCCAGAATGGCTGTCCTTTGATGTC 



657 



Qy 


601 


Db 


658 


Qy 


661 


Db 


718 


Qy 


721 


Db 


778 


Qy 


781 


Db 


838 


Qy 


841 


Db 


898 


Qy 


901 


Db 


934 


Qy 


961 


Db 


994 


Qy 


1021 


Db 


1054 


Qy 


1081 


Db 


1114 


Qy 


1141 


Db 


1174 



601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

II Mill III MINIMUM II Ml 1 1 I Mill MIMMMIM 

658 ACTGGAGTCGTGAGGCAGTGGCTGAGCCATGGAGGGGAAGTCGAGGGCTTTCGCCTCAGT 717 



111,11 , Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II MMMMMMI 



I M I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I M I II I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



I M II 1 1 II 1 1 1 1 M 1 1 1 M 1 1 1 M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 



GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 900 

II IMIIIIIIIIIIIIIIMIIII IMIIIIIII 

GACACCAACTACTGCTTCAGCTCCAC GGAGAAGAAC 933 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

1 1 1 1 1 1 , I If I f I ! 1 1 1 1 1 If 1 1 1 r 1 1 1! I Mill MMMMIIIMIMI Ml 

TGCTGCGTCCGGCAGCTCTACATTGACTTCCGCAAGGATCTGGGCTGGAAGTGGATCCAT 9 93 
GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

M MM, Mill II II MMMMIIIMIMI MMMM MMMM 

GAGCCCAAGGGTTACCACGCTAACTTCTGCCTGGGGCCCTGCCCCTACATTTGGAGCCTG 1053 
GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

Mill MIMIMMIMMIMM IIIIIIIIIIIIIIIIIIIIIIIIIMIIIMI 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1113 
GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

IMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIMIIIIIII 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1173 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

I II M II I II I I II II I II I II I IMMMMIMM f f I I I I I I I I I I I I I I I I 
AAGCCCAAGGTGGAGCAGCTGTCGAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 123 0 



RESULT 8 

AGMTGFB 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



AGMTGFB 1561 bp mRNA linear 

Simian transforming growth factor-beta (TGF) mRNA, 
M16658 

M16658 .1 GI : 176552 

growth factor; transforming growth factor-beta. 
Cercopithecus aethiops (African green monkey) 
Cercopithecus aethiops 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata 
Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 
Cercopithecinae; Cercopithecus. 
1 (bases 1 to 1561) 

Sharpies, K., Plowman , G . D . , Rose,T.M., Twardzik, D. R . and 



PRI 27-APR-1993 
complete cds . 



Euteleostomi ; 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 

FEATURES 

source 



CDS 



sig_peptide 
mat_peptide 



BASE COUNT 
ORIGIN 



Purchio, A. F . 

Cloning and sequence analysis of simian transforming growth 

factor-beta cDNA 

DNA 6 (3) , 239-244 (1987) 

87246074 

3474130 

Original source text: African green monkey cells (cell line 
BSC-40) , cDNA to mRNA, clone pTGF-beta-2. 

Location/Qual if iers 

1. .1561 

/organism="Cercopithecus aethiops" 
/mol__type="mRNA" 
/db_xref ="taxon: 9534" 
262. .1434 

/note=" transforming growth factor-beta precursor" 
/codon_start=l 
/protein_id="AAA35369 . 1" 
/db_xref="GI : 176553" 

/ 1 rans 1 a t i on- " MP PSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTI DMELVKRK 
RIETIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEI YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSKDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS " 
262. .1095 

/note=" transforming growth factor-beta signal peptide" 
1096. .1431 

/product =" trans forming growth factor-beta" 
301 a 547 c 446 g 267 t 



Query Match 79 . 9%; 

Best Local Similarity 88.5%; 
Matches 1059; Conservative 



Score 956.6; DB 9 
Pred. No. 1.3e-165 
0; Mismatches 114 



Length 1561; 
Indels 24; 



Gaps 



1; 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

Ml 1 1 1 1 II MIIIIIIMI 1 1 1 1 1 1 Mi 1 1 1 1 I i 1 1 1 1 1 1 II 1 1 1 1 II III 

262 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 321 
61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

minimi 1 1 1 1 1 1 1 1 f 1 1 1 1 mil i f 1 1 1 r 1 1 1 1 1 1 f i 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 

322 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 381 
121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

I M I i I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM MIIIIIIIM MIMIIIIII Mill Ml 

382 GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 441 
181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

1 11 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 Mill II II M II 

4 4 2 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 501 
241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 M I IMMMI MM II MIIIIIIIM M III 

5 02 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 561 



Qy 



3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 



II Illll! IMIIIIIIIMMIIIIIIII IIIIIIMIIIIIM I 1 1 1 1 Illll 

562 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 



Db 


562 


GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


621 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Illll II Mill 1 I 1 MINI IIIIIIII 1 IIIIIIII II llllll 

TATGAC^GTTCT^GC^GAGC^CACACAGCATATATATGTTCTTCAACACATCAGAGCTC 


420 


Db 


622 


681 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


480 


Db 


682 


1 1 MMI 1 1 M MMMM MMIMI 1 II It t II 1 1 II 1 1 MINIMUM 

CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


741 


Qy 
Db 


481 
742 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

1 1 M 1 1 II 1 1 1 1 1 1 M 1 1 1 1 IIIIIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 Illlllllll 

AAGTTAAAAGTGGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 


540 
801 


Qy 
Db 


541 
802 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

MIIIIIIMIIIIIMIIIIM Mill MM lllllllll MM lllllllll 

TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 


600 
861 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


660 


Db 


862 


IIIIMMMMIIMIMM Ml MM Ml M M Mill MMMM II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


921 


Qy 
Db 


661 
922 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

Illllll MM IMIIIIMMMMIIMM II Mill II MMMMM 1 

GCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGA 


720 
981 


Qy 
Db 


721 
982 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

1 MMMM IIMIIIIIIIIII Illll MIMMIMMII MMMM III 

ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


780 
1041 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 


840 


Db 


1042 


MIMIMMI MMMIMIMM II MMI II 1 II M M 1 M II 1 1 M M M M 

ATGGCCACCCCACTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 


1101 


Qy 


841 


GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 

II MMIIIMIIIMIIIMIIII Illlllllll 

GACACCAACTACTGCTTCAGCTCCAC GGAGAAGAAC 


900 


Db 


1102 


1137 


Qy 

Db 


901 
1138 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

IMMIMMMIMM II MIMIMMI MMMM IIMIIIIIIIMI II 

TGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


960 
1197 


Qy 


961 


GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 

1 1 M M 1 II M II II M M M 1 M M M 1 M 1 II 1 II M II II II M 1 MMMM 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTG 


1020 


Db 


1198 


1257 


Db 


1258 


GA(^CTC^GTA(^GC^GGTCCTGGCTCTGTA(^CC^GCACAACCCGGGCGCGTCGGCG 

Illll I f 1 i 1 1 f 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMMM llllll 

GACACGC^GTACAGCAAGGTCCTGGCCCTGTAO^CCAGCATAACCCGGGCGCCTCGGCG 


1080 
1317 


Qy 


1081 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 


1140 


Db 


1318 


IIIMIIIIIIIMIIIIIIMIIIMIIIIIIIIIIIIIIMIIIIIIIMIIMIMI 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 


1377 


Qy 


1141 


AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

MIMIMIIIIMMIMIIIIIIMIIIMMIIM MMIMI lllllllll 



Db 13 78 AAGCCOVAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAATGCAGCTGA 1434 



RESULT 9 
BC022242 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC022242 1746 bp mRNA linear PRI 04-FEB-2002 

Homo sapiens, clone MGC:22008 IMAGE : 4399762 , mRNA, complete cds . 
BC022242 

BC022242.1 GI:18490115 
MGC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini ; Hominidae; Homo. 

1 (bases 1 to 1746) 

Strausberg,R. 

Direct Submission 

Submitted ( 01 -FEB-2002 ) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 

Contact : MGC help desk 

Ema i 1 : cgapbs - r@ma i 1 . nih . gov 

Tissue Procurement: ATCC 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Sequencing Group at the Stanford Human Genome 

Center, Stanford University School of Medicine, Stanford, CA 94305 

Web site: http://www-shgc.stanford.edu 

Contact: (Dickson, Mark) mcd@paxil.stanford.edu 

Dickson, M . , Schmutz, J., Grimwood, J., Rodriquez, A., and Myers, 
R. M. 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAK Plate: 27 Row: e Column: 21 

This clone was selected for full length sequencing because it 
passed the following selection criteria: Hexamer frequency ORF 
analysis . 

FEATURES Location/Qualifiers 
source 1. .1746 

/organism= " Homo sapiens " 

/ mo 1 t yp e = " mRNA " 

/db_xref ="taxon: 9606" 

/clone= "MGC: 22 008 IMAGE : 43 9 9762 » 

/tissue_type=" Duodenum, adenocarcinoma" 

/clone J. ib= "NIH_MGC_8 8 " 

/lab_host="DH10B" 

/not e=" Vector : pCMV-SP0RT6" 
CDS 370. .1542 

/codon_start=l 

/product =" Unknown (protein for MGC:22008)" 
/protein_id="AAH22242 . 1" 
/db_xref ="GI : 18490116" 

/translation="M P PSGLRLLLLLLPLLWLLVLTPGR PAAGLSTCKT I DMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 



Y Y AKEVTR VLM VETHN E I YDKF KQSTHS I YMF FNTS ELREAVP E P VLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLY I DFRKDLGWKW I HEPKGYHANFCLGPCPYI WSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS " 

BASE COUNT 376 a 612 C 472 g 286 t 

ORIGIN 

Query Match 79.9%; Score 956.6; DB 9; Length 1746; 

Best Local Similarity 88.5%; Pred. No. 1.3e-165; 



Matches 


1059; Conservative 0; Mismatches 114; Indels 24; Gaps 


1; 


Qy 


1 


ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 

III MM II IMMMMM III IMIMIII MMMMMIMI II III 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 


60 


Db 


370 


429 


Qy 


61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


120 


Db 


430 


f 1 [ 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 

CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 


489 


Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 


180 


Db 


490 


MMMIMIIIMMMMIMMI MMIMMM MIMMMM Mill Ml 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 


549 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 f 1 Mill M II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 


240 


Db 


550 


609 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 I 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 I 1 1 1 1 1 1 I I 1 1 III 

II II M 1 1 1 1 II 1 M II II 1 M M M II M MM M M M M 1 M M M M Ml 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 


300 


Db 


610 


669 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

II 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM Mill 
GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


360 


Db 


670 


729 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Mill II MIMI 1 1 MMM II M M II 1 MMMM 1 1 1 1 II 1 1 

TATGACAAGTTCAAGCAGAGTACACAC^ 


420 


•nv'U 
UD 


730 


789 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

II MMMM II Mill II MMMM IMMMMMMI 1 1 1 1 1 1 1 1 1 1 1 1 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


480 


Db 


790 


849 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II IIIIIMIII 

AAGTTAAAAGTGGAG CAG CACGTGGAG CTGTACCAGAAATA CAG CAACAATTCCTGG CGA 


540 


Db 


850 


909 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


600 


Db 


910 


IIMMMIMIMMIMIMI Mill IMM II MMM 1 M IMIMIII 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 


969 


Qy 


601 


ACCGGAGTTGTG CGG CAGTGG CTG A CC CG CAGAG AGG CTATAGAGGGTTTT CG CCT CAGT 

1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 1 II Ml Ml III II M Mill MMMM II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


660 


Db 


970 


1029 


Qy 


661 


GCCCACTCTTCCTCTGAC^GCAAAGATAAC^C^CTCCACGTGGAAATTAACGGGTTCAAT 


720 



Db 


1030 


Qy 


721 


Db 


1090 


Qy 


781 


Db 


1150 


Qy 


841 


Db 


1210 


Qy 


901 


Db 


1246 


Qy 


961 


Db 


1306 


Qy 


1021 


Db 


1366 


Qy 


1081 


Db 


1426 


Qy 


1141 


Db 


1486 



i"| MM MMMM MMMMMI M MMI M MMMMM I 

GCCCACTGCTCCTGTGACAGCAGGGATAACAC^CTGC^AGTGGACATCAACGGGTTCACT 108 9 
TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I MMMM MMMMMMMMMM MMMMMMM MMMM Ml 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 114 9 
ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 

MMMMMMMMIMMMMMM MMI M II I II I II I M II I II II I II I 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 12 09 
GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 900 

II MMMM MMMMMMM MMMMM 

GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 124 5 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

I M M M 1 , 1 1 1 1 1 1 1 1 MMMMMMM llllllll MMMMMMM II 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1305 
GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

M IIIIIIIIIIIIIMIIII MMMM llllllll MMMM MMMM 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 13 65 
GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

Mill MMMMMMMMMM 1 1 II 1 1 1 1 1 1 1 1 1 1 MMMMMI MUM 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 142 5 
GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

M M U 1 1 M II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 IMMIMMMIMMIIIIMIMI 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 14 8 5 
AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

1 1 1 1 1 1 1 i 1 1 1 1 f 1 1 1 1 1 r 1 1 1 r f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MiiiiiMiMiiiMi 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1542 



RESULT 10 

BC000125 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC000125 1780 bp mRNA linear PRI 12-JUL-2001 

Homo sapiens, Similar to transforming growth factor, beta 1, clone 
MGC:3119 IMAGE : 33 51664 , mRNA, complete cds . 
BC000125 

BC000125 .1 GI : 12 65274 8 
MGC. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1780) 
Strausberg, R. 
Direct Submission 

Submitted ( 03 -NOV-2000 ) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact: MGC help desk 



Craniata ; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



Email : cgapbs-r@mail .nih.gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Genome Sequence Centre, 

BC Cancer Agency, Vancouver, BC, Canada 

inf o@bcgsc . be . ca 

Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, 
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywinski , Reta Kutsche, Oliver Lee, Soo 
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline 
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, 
Michael Thome, Miranada Tsai, Natasja van den Bosch, Jill Vardy, 
George Yang, Scott Zuyderduyn, Marco Marra . 



FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 
Series: IRAL Plate: 6 Row: e Column: 11 

This clone was selected for full length sequencing because it 
passed the following selection criteria: Hexamer frequency ORF 
analysis . 

Loca t ion/ Qua 1 i f i er s 
1. .1780 

/organism="Homo sapiens" 

/mo l__t ype = " mRNA " 

/db_xref="taxon: 9606" 

/clone="MGC:3119 IMAGE : 3351664 " 

/tissue_type="Eye, retinoblastoma" 

/clone_lib="NIH_MGC_16" 

/lab_host="DH10B-R" 

/note= "Vector: pOTB7" 

447. .1619 

/codon_start=l 

/product="Similar to transforming growth factor, beta 1" 
/protein_id="AAH00125 .1" 
/db_xref="GI : 12652749" 

/ trans lat ion="MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEI YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 
341 a 648 c 493 g 298 t 



Query Match 79.9%; Score 956.6; DB 9 

Best Local Similarity 88.5%; Pred. No. 1.3e-165 
Matches 1059; Conservative 0; Mismatches 114 



Length 1780; 
Indels 24; 



Gaps 



1; 



Qy 

Db 



1 ATGG CG C CTT CGGGG CTG CGG CTCTTG C CGCTG CTG CTG C CG CTG CTGTGGCTG CTAGTG 6 0 

III 1 1 1 1 II lllllllllll III MINIMI MMMIMIMM 1 1 III 

4 4 7 ATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 506 



Qy 



61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

M M M I III 1 1 II I M II 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMMMMMI 



Db 5 07 CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 566 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

IIIIIIIIIIIIIIMIMIIMIM IMMIIIIM lllllllllll Mill III 

Db 567 GTGAAGCGGAAG CG CATCGAGG C CAT C CG CGG CCAGATC CTGTCCAAG CTG CGG CT CG CC 62 6 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIIMIIIIMIMIIIMI IMIMIIIMIIIIIMII Mill M II II II 

Db 627 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 686 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

MMMM MMMIIMIMI MMIMI MM IMMMMMMMM III 

Db 687 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 74 6 

Qy 3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 60 

II II M M II M M M M M M M M M M II M 1 1 M II M M I I MM Mill 

Db 74 7 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 8 06 

Qy 3 61 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

IMM M MIMI I I MMII MMMM I MMMM II MUM 

Db 8 07 TATGACAAGTTC^GCAGAGTACAC^C^GCATATATATGTTCTTC^CAC^TCAGAGCTC 866 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 80 

II MMMM M Mill II MMMM IMMIIMMMI MIMIMMII 

Db 867 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 92 6 

Qy 4 81 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IMMMMMMMMMMMMMM MMMIMMMMM MMIMMI 

Db 927 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 986 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

MIMMMMMMMIMMI Mill Mill M 11 1 1 1 1 I M 1 1 1 II M 1 1 

Db 987 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 104 6 

Qy 6 01 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

II I M I II II 1 1 II 1 1 1 II 1 1 Ml Ml III II II Mill MMMM II 

Db 104 7 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1106 

Qy 661 GCCC^CTCTTCCTCTGAC^GCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

MMMI MM MMMM II I II M 1 1 II M IMM II Illlllllll I 

Db 1107 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1166 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I MMMM MM MIMUM 1 1 M 1 1 1 IIIIIIIIIIIIII MMMM III 

Db 1167 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 122 6 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 

I I M II I I I I I M I I II I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I | I I I I I I I I I 

Db 1227 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 12 86 

Qy 841 GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 900 

II MMMM IIIIIIIIIIIIII MMIMMI 

Db 12 87 GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 1322 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMIIMMMI IIIIIIII IIIIIIIIIIIIII M 

Db 1323 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 13 82 



Qy 


961 


GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 

II llllllllllllllllllll llllllll llllllll llllllll llllllll 
GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 


1020 


Db 


1383 


1442 


Qy 


1021 


GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 

him iiiiiiiiiiiiiiiiiiii iiiiiiiiiiiiii iiiiiiiiiii mini 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 


1080 


Db 


1443 


1502 


Qy 


1081 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 

Mill II 1 TIN 'MM II 1 IIIIIIIIIIIIIIIIMIIIIillll 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 


1140 


Db 


1503 


1562 


Qy 
Db 


1141 
1563 


AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

M 1 M 1 1 II Ml 1 Ml 1 1 1 1 1 1 1 1 M 1 1 II MM! 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 

AAGGCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1619 



RESULT 11 

BC001180 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 



BC001180 1780 bp mRNA linear PRI 12-JUL-2001 

Homo sapiens, Similar to transforming growth factor, beta 1, clone 
MGC:2323 IMAGE: 3356605, mRNA, complete cds . 
BC001180 

BC00118 0 . 1 GI : 12654682 
MGC . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1780) 
Strausberg, R . 
Direct Submission 

Submitted ( ll-DEC-2000) National Institutes of Health, Mammalian 
Gene Collection (MGC) , Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive, Room 11A03, Bethesda, MD 20892-2590, 
USA 

NIH-MGC Project URL: http://mgc.nci.nih.gov 
Contact : MGC help desk 
Email : cgapbs-r@mail .nih.gov 
Tissue Procurement: ATCC 

cDNA Library Preparation: Rubin Laboratory 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Genome Sequence Centre, 

BC Cancer Agency, Vancouver, BC, Canada 

inf o@bcgsc . be . ca 

Steven Jones, Jennifer Asano, Ian Bosdet, Yaron Butterfield, 
Susanna Chan, Readman Chiu, Chris Fjell, Erin Garland, Ran Guin, 
Letticia Hsiao, Martin Krzywinski , Reta Kutsche, Oliver Lee, Soo 
Sen Lee, Victor Ling, Carrie Mathewson, Candice McLeavy, Steven 
Ness, Pawan Pandoh, Anna-Liisa Prabhu, Parvaneh Saeedi, Jacqueline 
Schein, Duane Smailus, Michael Smith, Lorraine Spence, Jeff Stott, 
Michael Thorne, Miranada Tsai, Natasja van den Bosch, Jill Vardy, 
George Yang, Scott Zuyderduyn, Marco Marra . 



Clone distribution: MGC clone distribution information can be found 
through the I.M.A.G.E. Consort ium/LLNL at: http://image.llnl.gov 



Series: IRAL Plate: 6 Row: e Column: 10 

This clone was selected for full length sequencing because it 
passed the following selection criteria: matched mRNA gi : 37097. 
FEATURES Loca t ion/Qual i f lers 

source 1. .1780 

/organism="Homo sapiens" 
/ mol_type= " mRNA " 
/db_xref ="taxon: 9606" 
/clone="MGC:2323 IMAGE : 3356605 " 
/ 1 is sue_type= " Eye , ret inoblas toma " 
/clone_lib="NIH_MGC_16" 
/lab_host="DH10B-R" 
/not e=" Vector: pOTB7" 
CDS 447. .1619 

/codon_start=l 

/product="Similar to transforming growth factor, beta 1" 
/protein_id="AAH01180 .1" 
/db_xref = "GI : 12654683" 

/ translat ion="MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSPCLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKE VTR VLM VETHN E I YDKFKQSTHS I YMFFNTS ELREAVPE P VLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 

BASE COUNT 341 a 648 c 493 g 298 t 

ORIGIN 

Query Match 79.9%; Score 956.6; DB 9; Length 1780; 

Best Local Similarity 88.5%; Pred. No. 1.3e-165; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

Ml MM M MMMMMI Ml MMMMI MMMMMMM M Ml 

Db 4 47 ATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 506 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill I f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 5 07 CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 566 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 MMMMMI MMMMMI Mill III 

Db 5 67 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 626 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill II II II II 

Db 627 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 68 6 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

IMIIIII MMMMMMM I MINI I MM 1 1 II 1 1 1 1 M M M 1 1 1 III 

Db 687 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 74 6 

Qy 3 01 GCGGACTACTACGCOUVGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 60 

II 1 1 1 M I M II I M 1 1 II M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 II I I MM Mill 

Db 74 7 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 8 06 



Qy 



3 61 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 



Mill II MINI I I llllll llllllll I llllllll II MINI 

Db 8 07 TATGACAA.GTTCA&GCAGAGTACA 8 66 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 80 

II MINIM 1 1 Mill II IMMMI MMMMMMM 1 1 1 1 1 1 1 1 1 1 1 1 

Db 8 67 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 926 

Qy 4 81 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IMIIMMMMMMMMMIMMMMMMMMMMM IMMMMI 

Db 927 AAGTTAAAAGTGGAG CAG CACGTGGAG CTGTAC CAGAAATACAG CAACAATT CCTGGCGA 986 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 6 00 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill Mill M llllll I II IMMMM 

Db 987 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1046 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGGTATAGAGGGTTTTCGCCTCAGT 660 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml III III II II Mill llllllll II 

Db 104 7 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1106 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

MIMM MM llllllll 1 1 1 1 1 1 1 1 1 1 1 II Mill II IMMMMI I 

Db 1107 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1166 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I IMMMI 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 MMMMMMM IMMMI III 

Db 1167 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1226 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 

1 1 E [ 1 1 [ 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill IMMMMMMIMMMMM 

Db 122 7 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 128 6 

Qy 841 GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 900 

II IMMMI llllllllllllll MINIUM 

Db 128 7 GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 1322 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMMMMM llllllll llllllllllllll II 

Db 1323 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1382 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II 1 1 1 II : 1, 1 1 1 II 1 1 1 1 1 1 ! IMMMI MINIM IMMMI llllllll 

Db 13 83 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1442 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 1080 

Mill I , I 1 1 1 1 1 1 1 1 ; 1 . 1 1 1 1 I M I II M II II II IINNNIN MINI 

Db 14 4 3 GAC^CGCAGTAC^GC^GGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1502 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r r mnnmninnimmmmin 

Db 15 03 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1562 

Qy 1141 AAGCCC^GGTGGAGCAGCTGTCO^CATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

M M M i I M M II IMIIMM Ml M Ml 1 1 ll 1 1 1 1 1 1 1 1 1 1 II II II II 1 1 1 

Db 1563 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1619 



RESULT 



12 



E03028 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

COMMENT 



E03028 1821 bp RNA linear PAT 29-SEP-1997 

DNA encoding human prepro TGF-betal . 

E03028 

E03028 . 1 GI : 2171250 
JP 1991180192-A/l. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1821) 

Ohashi,H., Ishii,Y., Miyata,Y., Miyazono,K., Miyagawa,K. and 
Takaku,F. 

PRODUCTION OF HUMAN PRO-TGF-BETAl BY GENETIC RECOMBINATION 
Patent: JP 1991180192-A 1 06-AUG-1991; 
KIRIN BREWERY CO LTD 
OS Homo sapiens (human) 
PN JP 1991180192-A/l 
PD 06-AUG-1991 
PF 07-DEC-1989 JP 1989318243 
PI OHASHI HIDEYA, ISHII YASUYUKI , 
KOHEI , 



MIYATA YOSHINORI, PI MIYAZONO 



PI 
PC 
PC 
PC 
CC 
CC 
CC 
CC 
CC 
CC 
FH 
FH 
FT 
FT 
FT 

sig_peptide 
FT 
FT 
FT 
FT 
FT 



MIYAGAWA KIYOSHI, TAKAKU FUMIMARO 

C12P21/00, C07K13/00, C12N5/10 , C12N15/18//C12Q1/68 , (C12P21/00, 
C12R1 : 91) , 

(C12N15/18,C12R1: 91) ; 
strandedness : Double; 
topology: Linear; 
hypothetical: No; 
ant i -sense: No; 

♦source: t issue_type=placenta ; 
clone=pVC19-TGF; 

Location/Quali f iers 



♦source 
Key 



5 'UTR 
CDS 



mat_peptide 
mat_peptide 



1. .510 
511. .1683 

/product = 'human prepro TGF-betal' 
511. .597 

598. .1680 

/note=' human pro TGF-betal' 
1345. .1680 

/product = 'human TGF-betal' 
1684. .1820. 



FT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



3 'UTR 

Location/Qualifiers 
1. .1821 

/organism="Homo sapiens" 
/mol_type= "genomic RNA" 
/db_xref ="taxon: 9606" 
326 a 679 c 508 g 308 



Query Match 79 . 9%; 

Best Local Similarity 88.5%; 
Matches 1059; Conservative 



Score 956.6; DB 6 
Pred. No. 1.3e-165 
0; Mismatches 114 



Length 1821; 
Indels 24; Gaps 



1; 



QY 



ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 

III 1 1 1 1 II 'IIMIIII MMMMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III 



60 



Db 


512 


Qy 


61 


Db 


572 


Qy 


121 


Db 


632 


Qy 


181 


Db 


692 


Qy 


241 


Db 


752 


Qy 


301 


Db 


812 


Qy 


361 


Db 


872 


Qy 


421 


Db 


932 


Qy 


481 


Db 


992 


Qy 


541 


Db 


1052 


Qy 


601 


Db 


1112 


Qy 


661 


Db 


1172 


Qy 


721 


UD 


1Z jz 


Qy 


781 


Db 


1292 


Qy 


841 


Db 


1352 



512 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 571 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

iiiiiiiiiii i iiiiii inn iiiiiiiiiiiiii 1 1 : 1 1 .1 ii i ii 1 1 1 

CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 631 

GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

1 1 1 1 1 M 1 1 1 1. 1 II Ih 1 1 1 1 1 M lllllllllll lllllllllll Mill III 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 691 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIIIIIIIMIIIIIIIIII I M I M I II I M 1 1 1 1 Ml Mill II M II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 751 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

llllllll lillllllllllll llllllll MM Ml 1 1 1 1 1 M M II 1 1 III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 811 

GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II 1 1 1 1 1 1 1 1 : 1 1 11 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M I i I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGGGTGCTAATGGTGGAAACCCACAACGAAATC 871 

TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill II IIIIII I I IIIIII llllllll I llllllll II IIIIII 

TATGACAAGTTCAAGC^GAGTAC^CACAGCATATATATGTTCTTCAACACATCAGAGCTC 931 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 

II llllllll II Mill II llllllll IIIIIIIIIIIIII I I I I I I | I I I I | 
CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 991 

AAGTTAAAAGTGGAGCAGCACGTGGAGGTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IIMIMIMIIMMMIMMIMM MMMMMMMMI Illlllllll 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 1051 

TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

I I I I I I I I I I I I I I I I M I I I II Mill Mill II IIIIII I II Mill 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1111 

AC CGGAGTTGTG CGG CAGTGG CTGACGCG CAGAGAGG CTATAGAGGGTTTTCG C CTCAGT 660 

M II I I I II I I I I I II I I I I I III III III II II Mill llllllll II 

AC CGGAGTTGTG CGG CAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1171 

GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCC^ 72 0 

I I I M 1 1 MM IIIIMM Illlllllll II Mill II Illlllllll I 

GCCCACTGCTCCTGTGAC^GC^GGGATAACACACTGC^GTGGACATC^ACGGGTTCACT 1231 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I iiiiiiii 'Minimum mimmimmm imiimi mi 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 12 91 

ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 

M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 Mill MIMIIMIMIIMIMIMII 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 1351 

GATACC^CTACTGCTTa\GCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 900 

M MMMM M I I I II I I I I I I | Illlllllll 

GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 13 8 7 



Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 96 0 

IIIIIIIIIIIIIIMI llllllllllllll llllllll I I I I I I I I I I I I I ' II 

Db 1388 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1447 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II lllllllllllllllllll llllllll llllllll llllllll llllllll 

Db 144 8 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1507 

Qy 1021 GACACTCAGTACAGC^GGTCCTGGCTCTGTAC^CC^G(^C^ACCCGGGCGCGTCGGCG 1080 

Mill I I M I T I I I I I I I I I I I I llllllllllllll lllllllllll MMII 

Db 15 08 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1567 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

I M I I I I I II I M I I I I I I I II I II I I I:' II I I I I I I I II I I II I II Mi I I II I I I I I 

Db 1568 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1627 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIMMIIII I I I I I I I I ! I I I I I I 
Db 1628 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1684 



RESULT 13 

A06669 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



CDS 



BASE COUNT 
ORIGIN 



A06669 2537 bp mRNA linear PAT 29-JUL-1993 

Synthetic mRNA for preTGF-Betal . 

A06669 

A06669 . 1 GI :412940 

synthetic construct 
synthetic construct 
artificial sequences. 
1 (bases 1 to 2537) 

NUCLEIC ACID ENCODING TGF- beta 3 AND ITS USE 
Patent: WO 8912101-A 4 14-DEC-1989; 

Location/Qualifiers 

1. .2537 

/organism="synthetic construct" 

/mol_type="mRNA" 

/db_xref="taxon: 32630" 

842. .2014 

/codon_start=l 

/ trans l_table= 11 

/product="preTGF-betal" 

/protein_id="CAA00588 . 1" 

/db_xref="GI : 4 12 941" 

/ translat ion= "MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEI YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDS PEWLSFDVTGWRQWLSRGGE I EGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDT 
QYS KVLAL YNQHN PGASAAPCC VPQALE PLP I VYY VGRKPKVEQLSNM I VRSCKCS " 
473 a 893 c 739 g 432 t 



Query Match 79.9%; Score 956.6; DB 6; Length 2537; 

Best Local Similarity 88.5%; Pred. No. 1.2e-165; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 



Qy 

Db 


i 

842 


ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 

III MM II 1 1 1 1 1 1 1 1 1 1 1 1 MINIUM! 1 1 1 II 1 1 1 1 1 1 1 1 1 II III 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 


60 
901 


Qy 


61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


120 


Db 


902 


MMMIMMM MIMMM Mill MMMMMMM MMMIMMMM 

CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 


961 


Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

MIMMM MIMIIMIMM MIMMM MINIUM Mill III 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 


180 


Db 


962 


1021 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

MMMMMMMMI M M M 1 M 1 M 1 1 II II 1 1 1 1 Mill II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 


240 


Db 


1022 


1081 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

llllllll MMMMMMM llllllll MM 1 M 1 1 II 1 II 1 1 1 1 1 II III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 


300 


Db 


1082 


1141 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

II M M M MM MM M MM Ml M M M M M M M M M 1 1 MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


360 


Db 


1142 


1201 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Mill II llllll 1 1 llllll MINIM 1 llllllll II llllll 

TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 


420 


Db 


1202 


1261 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

II llllllll N Mill II llllllll MMMMMMM 1 1 1 1 1 1 1 1 1 1 1 1 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


480 


Db 


1262 


1321 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

M MM II 1 Mill III IMMMIMM 1 Ml MMMM MM 1 IINIIMN 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 


540 


Db 


1322 


1381 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

MMMMMMMIMM M INN INN II llllll 1 II MMMM 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 


600 


Db 


1382 


1441 


wy 




a r , pppz\rjTTnTf^pr^ppAPTniPPT , p.A pppppziPZ\pzipppT > fiT , &pzipppT'T ,r r r rrv r " , r , PTPAP r r 

1 1 1 1 M 1 1 1 1 Ml MM 1 1 M III III III II II INN llllllll II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


bbU 


Db 


1442 


1501 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

MM INI MMMM MIMMM II Mill II MMMM 1 1 

GCCC^CTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 


720 


Db 


1502 


1561 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

1 llllllll MMMMIMMMIMM MMMMMMM llllllll III 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


780 


Db 


1562 


1621 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 


840 



II Mill II I M III I I 

Db 1622 ATGGC(^CCCCGCTGGAGAGGGCC(^G(^TCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 1681 

Qy 841 GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 9 00 

II MINIM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMMM 

Db 1682 GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 1717 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1718 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIM MINIM 

Db 1778 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

Mill II II Mill Mill!! MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IM II IIMM 

Db 1838 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1897 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

M II J 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 f 1 1 1 ! 1 1 1 1 1 1 1 

Db 18 98 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

1 1 1 1 1 1 m i : 1 1 : 1 I I I I I I I I I I I 111 III II I I I II I i| I I I M I I 

Db 1958 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2 014 



RESULT 14 

E00973 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



E00973 2527 bp RNA linear PAT 29-SEP-1997 

cDNA encoding human TGF-beta . 

E00973 

E00973.1 GI:2169234 
JP 1986219395-A/l. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 2527) 
Riku, M . A . D . D . and Debitsudo, B . G . 
NUCLEIC ACID CODED WITH TGF-BETA AND ITS USE 
Patent: JP 1986219395-A 1 29-SEP-1986; 
GENENTECH INC 
OS human 

PN JP 1986219395-A/l 

PD 29-SEP-1986 

PF 20-MAR-1986 JP 1986064661 

PR 22-MAR-1985 US 85 715142 

PI RIKU MAIKERU ANDORE DERINKU, DEBITSUDO BANNOOMAN GETSUDERU PC 

C12P21/00,C12Nl/00 / C12N5/00 / C12N15/00//C12Ql/68, (C12P21/00, PC 
C12R1: 91) , 

PC (C12N1/00,C12R1 : 19) , (C12N5/00 , C12R1 : 91 ) , (C12N15/00, C12R1: 91) ; 
CC strandedness : Double; 
CC topology: Linear ; 
CC hypothetical: No; 
CC ant l- sense: No; 





♦source: tissue^ 


_type=placenta 


ana giyoDias coma ; 




Location/Qualifiers 




DU 

r n 








FT 


CDS 


842. .2014 




FT 




/product = 'pre 


TGF-beta 1 


FT 


mat_pept ide 


1676. .2011 




FT 




/product= 1 TGF- 


-beta ' 


FT 


5 'UTR 


1. .841 




FT 


3 'UTR 


2015. .2537 




FT 


stem_loop 


37. .113 




FT 


stem_loop 


2015. .2100 




FT 


polyA_site 


2514. .2519. 





FH 



Key 



FEATURES 

source 



BASE COUNT 
ORIGIN 



472 



Location/Qualifiers 
1. .2527 

/organism="Homo sapiens" 
/mol_type="genomic RNA" 
/db_xref ="taxon: 9606" 
a 888 c 735 g 432 



Query Match 79.8%; 
Best Local Similarity 88.4%; 
Matches 1058; Conservative 



Score 955; DB 6; Length 2 527; 
Pred. No. 2.4e-165; 
0; Mismatches 115; Indels 24; 



Gaps 



1; 



Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 

ill 1 1 M II iliiiiiiiii :i 1 1! 1 1 ; 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f f 1 1 j ii ill 

84 2 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 



61 



902 



121 



962 



181 



CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 

1 1 [ 1 1 1 1 [ 1 1 E 1 1 MINIM Mill llllllllllll II Mil 

CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 



60 



901 



120 



961 



180 



GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

IIIIIIMIIIIIIIMIMIIMII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 Mill III 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 



240 



AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

IIIIIIIIMIIIIIMIII III Mill II II II II 

1022 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTC 1081 



241 



300 



TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

MINIM 1 N 1 1 1 III 1 1 1 II I F 1 1 1 1 1 1 MM "I III 1 1 IN 1 1 1 1 1 1 III 

1082 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 



Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

II 1 M 1 II 1 1 1 IN Nil II II 1 II 1 N 1 1 N 1 II 1 III II II 1 1 MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


360 


Db 


1142 


1201 


Qy 


361 


TATGATAAATTC^GGGC^CCCCC(^(^GCTTATATATGCTGTTC^CACGTCGGAGCTC 

Mill II Nil 1 I 1 II II 1 llllllll 1 llllllll II INI 

TATGACAAGTTCAAGCAGAGTACACA.CAGCATAT^ 


420 


Db 


1202 


1261 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

II llllllll II Mill II llllllll 1 ill 1 1 llllllllllll 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


480 


Db 


1262 


1321 



Qy 



481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 



Db 


1322 


Qy 


541 


Db 


1382 


Qy 


601 


Db 


1442 


Qy 


661 


Db 


1502 


Qy 


721 


Db 


1562 


Qy 


781 


Db 


1622 


Qy 


841 


Db 


1682 


Qy 


901 


Db 


1718 


Qy 


961 


Db 


1778 


Qy 


1021 


Db 


1838 


Qy 


1081 


Db 


1898 


Qy 


1141 


Db 


1958 



j 1 1 1 1 1 ; II ; 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 !i M III MINI III 1 1 IMIIIIIII 

AAGTTAAAAGTGGAGCAGC^CGTGGAGCTGTACC^GAAATA^GCAAO^ATTCCTGGCGA 1381 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

1 1 ! 1 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 ; Mill Mill II MUM I M IIIM III 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1441 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 III Ml Ml II II IIIM Mllllll II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1501 
GCCCACTCTTCCTCTGACAGOWVGATAACAC^CTCC^CGTGGAAATTAACGGGTTCAAT 720 

1 1 1 1 1 1 1 MM Mllllll I MM M IIIM II IMIIIIIII I 

GCCCACTGCTCCTGTGACAGC^GGGATAAC^CACTGC^AGTGGACATCAACGGGTTCACT 1561 
TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 Mllllll III 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 
ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 1681 
GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 900 

ii 1 1 1 1 1 1 1 1 iiiMMiiiiiii Milium 

GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 1717 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

I M I I I I II I II I II I I llllllllllllll Mllllll llllllllllllll II 
TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 

GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II IIMMMIIIIIIIIIIII Mllllll Mllllll Mllllll Mllllll 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 
GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

iiiii nil i i i iiiiiiiiiiiiii iiiiiiiiiii mill 

GAC^CGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 18 97 
GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

I I I M 1 1 1 1 : 1 1 1 1 1 1 1 ( 1 1 1 1 1 1 i 1 1 1 1 1 1 II 1 1 M II M 1 1 1 1 1 1 II I M I II Ml 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 
AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

I M 1 1 1 1 1 1 1 1 II M 1 1 1 M M 1 1 1 1 II I IM M I M 1 1 1 1 II 1 1 II 1 1 1 1 1 II I 

AAGCCCAAGGTGGAGGAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2 014 



RESULT 15 

BT007245 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 



BT007245 1173 bp mRNA linear PRI 13-MAY-2003 

Homo sapiens transforming growth factor, beta 1 (Camurati -Engelmann 
disease) mRNA, complete cds . 
BT007245 

BT0 0724 5. 1 GI : 3 058332 8 
FLI_CDNA . 

Homo sapiens (human) 



ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 {bases 1 to 1173) 

AUTHORS Kalnine,N., Chen,X., Rolfs, A., Halleck,A., Hines,L. , Eisenstein , S . , 

Koundinya,M. , Raphael, J., Moreira,D., Kelley,T., LaBaer,J., Lin,Y., 

Phelan,M. and Farmer, A. 
TITLE Cloning of human full-length CDSs in BD Creator (TM) System Donor 

vector 
JOURNAL Unpubl i shed 
REFERENCE 2 (bases 1 to 1173) 

AUTHORS Kalnine,N., Chen,X., Rolfs, A., Halleck,A. , Hines,L., Eisenstein , S . , 

Koundinya , M. , Raphael, J., Moreira,D., Kelley,T., LaBaer,J., Lin,Y., 

Phelan,M. and Farmer, A. 
TITLE Direct Submission 

JOURNAL Submitted ( 13 -MAY-2003 ) BD Biosciences Clontech, 1020 East Meadow 
Circle, Palo Alto, CA 94303, USA 
COMMENT This CDS clone is a part of a collection of human full length 

expression clones generated by BD Biosciences Clontech and the 
Harvard Institute of Proteomics . Each CDS has been cloned in two 
forms: with and without stop-codon (to allow fusion with C-terminal 
tag). The CDS has been directionally cloned using BD In-Fusion (TM) 
cloning system between the Sail and Hindi I I sites of the pDNR-DUAL 
vector. Additional sequences in the clone: 'ACC after Sail site 
and before ' ATG 1 to provide Kozak consensus sequence; 'GG' after 
last codon and before Hindi I I site to maintain reading frame. 
Clone distribution : http : //bioinfo . clontech . com/orf clones . 
FEATURES Location/Qualifiers 
source 1. .1173 

/organism= "Homo sapiens " 
/ mo 1 _t yp e = " mRNA " 
/db_xref= M taxon: 96 06" 
/clone="GH00247Xl. 0" 

/clone_lib="BD Creator (TM) CDS Library derived from MGC 
collection" 

/lab_host="DH5alpha Tl resistant" 
/note= "Vector : pDNR-Dual " 
CDS 1. .1173 

/codon_start=l 

/product =" trans forming growth factor, beta 1 
(Camurati-Engelmann disease) " 
/protein_id="AAP35909 .1" 
/db__xref="GI : 30583329" 

/ trans la t ion = "MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRK 
RIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEAD 
YYAKEVTRVLMVETHNEIYDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSNRLLAPSDS PEWLSFD VTGVVRQWLSRGGE I EGFR 
LSAHCSCDSRDNTLQVDINGFTTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRH 
RRALDTNYCFSSTEKNCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDT 
QYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS" 

BASE COUNT 238 a 380 c 352 g 203 t 

ORIGIN 



Query Match 79.7%; Score 954.6; DB 9 

Best Local Similarity 88.5%; Pred. No. 3.1e-165 
Matches 1057; Conservative 0; Mismatches 114 



Length 1173; 

Indels 24; Gaps 1; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


UD 


DDI 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 



ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III 1 1 1 1 II III IM Ml MINIMI MIIMII Mill I II III 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 6 0 
CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

1 1 ! 1 1 : 1 1 1 1 1 1 1 II 1 1 M II I Mill llllllllllllll 1 1 1 1 1 1 1 1 :i II IN 

CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 12 0 
121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 
121 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 18 0 

AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

llllllllllllllllllll llllllllllllllllllll Mill II II II II 
AGCCGCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 24 0 

TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

llllllll llllllllllllll MIIMII MM II M II 1 1 1 1 1 1 1 1 M III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 30 0 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II I M 1 1 1 1 M II 1 1 1 1 II 1 1 1 1 i 1 1 M I II I II I II ! I II : I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 360 



II II llllll I I IIMM llllllll I llllllll II Mill 



CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 

M llllllll II Mill II llllllll ! I I I I I I I I I I I I llllllllllll 
CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 4 8 0 

AAGTTAAAAG TGG AG CAG CA CGTGG AG CTATA CCAGAAATA CAG CAATGATT C CTGG CG C 54 0 

! I I I I I I I I I ' I I I I lllllllllllllllll llllllllll 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 54 0 

TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

I M : 1 1 : 1 1 1 Mi I II 1 1 : 1 II Mill Mill II IIMM I II llllllll 

TACCTCAGCAACCGGCTG CTGG CACC CAG CGACTCGCCAGAGTGGTTATCTTTTGATGTC 600 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

I II 1 1 1 1 1 1 1 II 1 1 II I M 1 1 Ml III III II II Mill MIIMII II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 660 
GCCC^CTCTTCCTCTGAC^GO^GATAACACACTCCACGTGGAAATTAACGGGTTC^T 72 0 

lllllll MM MIIMII MIMIMIM M Mill II llllllllll I 

GCC(^CTGCTCCTGTGACAGC^GGGATAA(^CACTGa^GTGGAC^TCAACGGGTTC^CT 72 0 
TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I MIIMII M IM I M II 1 1 1 1 1 1 1 1 1 1 1 1 r I f 1 1 1 1 1 1 1 1 MIIMII Ml 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 78 0 
ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J I J 1 1 f 1 1 1 1 Mill Ml llllllll Mill lllllll 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 84 0 



Qy 



841 GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 900 



841 



II Ml II llllllllllllll 

GACACCAACTATTGCTTCAGCTCCAC - 



MINIUM 

- GGAGAAGAAC 



876 



960 



Db 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

I I I I I I I I I II I 1 1 I I I MIIIIMIMIII M M llllllllllllll II 

Db 8 77 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 936 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIM IMIIIII 1 II I'll Mill II 

Db 937 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 9 96 

Qy 1021 GAC^CTCAGTACAGCAAGGTCCTGGCTCTGTAC^CCAGCACAACCCGGGCGCGTCGGro 108 0 

Mill 1 1 1 II I II II II I M III II llllllllllllll MIMMIMI llllll 

Db 997 GAC^CGCAGTACAGC^GGTCCTGGCCCTGTAO^CCAGCATAACCCGGGCGCCTCGGCG 1056 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

IIMM III IIMIMM II II I 1 1 II M 1 1 1 MM 1 1 1 II II II I Ml I 

Db 1057 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1116 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCT 1195 

IMIIIII II MM IIIMIMI II MIIMI MM 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

Db 1117 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCT 1171 



Search completed: October 27, 2003, 18:34:51 
Job time : 4675.52 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: October 27, 2003, 10:47:27 ; Search time 359.617 Seconds 

(without alignments) 
8985.201 Million cell updates/sec 

Title: US-10-017-372E-12 
Perfect score: 1197 

Sequence: 1 atggcgccttcggggctgcg gttcctgcaagtgcagctga 1197 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



2552756 seqs , 1349719017 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 
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/SIDSl/gcgdata/geneseq/geneseqn 
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/SIDSl/gcgdata/geneseq/geneseqn 
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embl / NA1 981. DAT : * 
embl/NA1982 . DAT: * 
embl / NA1 983. DAT : * 
embl/NA1984 .DAT: * 
embl / NA1 985. DAT : * 
embl/NA1986.DAT:* 
embl /NA1 987 . DAT : * 
embl/NA1988 .DAT:* 
-embl /NA198 9 . DAT : * 
-embl /NA1 990 . DAT : * 
-embl /NA1991. DAT:* 
-embl/NA1992 .DAT:* 
-embl /NA1 993 . DAT : * 
-embl /NA1 994 . DAT: * 
-embl/NA1995 .DAT: * 
-embl /NA1996 .DAT: * 
-embl /NA1997. DAT: * 
-embl/NA1998 .DAT: * 
-embl/NA1999 .DAT: * 
-embl/NA2 000 .DAT: * 
- embl /NA2 001A.DAT: * 
-embl /NA2 001B . DAT : * 
embl /NA2 0 02 .DAT: * 
embl /NA2 003 .DAT: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AAD22696 

ID AAD22696 Standard; cDNA; 1326 BP. 
XX 

AC AAD22696; 
XX 

DT 26-FEB-2002 (first entry) 
XX 

DE Porcine transforming growth factor beta 1 (TGF-betal) cDNA . 
XX 

KW Porcine; transforming growth factor beta 1; TGF-betal; gene therapy; 

KW IBD; inflammatory bowel disease; autoimmune disease; immunosuppressive; 

KW multiple sclerosis; rheumatoid arthritis; systemic lupus erythematosus; 

KW diabetes mellitus; sarcoidosis; psoriasis; dermatological ; ss. 
XX 

OS Sus scrofa. 
XX 

FH Key Location/Qualifiers 

FT CDS 16.. 1188 

FT /*tag= a 

FT /product = "Porcine TGF-betal mutant protein" 
XX 

PN WO200181404-A2 . 
XX 

PD 01-NOV-2001. 
XX 

PF 20-APR-2001; 2001WO-US12980. 
XX 

PR 20-APR-2000; 2000US-199014P . 
XX 



PA (USSH ) US DEPT HEALTH & HUMAN SERVICES. 
XX 

PI Strober W, Nakamura K, Kitani A, Fuss IJ; 
XX 

DR WPI; 2002-026155/03. 

DR P-PSDB; AAE13596. 
XX 

PT Composition for treating autoimmune diseases e.g. inflammatory bowel 

PT disease in humans, comprises vector containing transforming growth 

PT factor-beta under the control of inducible promoter 
XX 

PS Claim 1; Fig 1; 78pp ; English. 
XX 

CC The invention relates to a composition containing a vector comprising a 

CC gene encoding a regulatory transcription factor under the control of a 

CC promoter encoding a transforming growth factor-beta (TGF-beta) . The 

CC vector is useful for expressing TGF-beta, such as TGF-betal, TGF-beta2 

CC or TGF-beta3, its variants or homologues, by transfecting a cell which 

CC is part of a host suspected of having an autoimmune disease, especially 

CC inflammatory bowel disease ( IBD) , under conditions such that the 

CC polypeptide encoded by the nucleic acid sequence in the vector is 

CC expressed. The vector is delivered using a delivery system. The delivery 

CC of the vector results in substantial elimination of symptoms of the 

CC autoimmune disease and increased production of IL-10 by the host. The 

CC composition is useful for treating various diseases with an autoimmune 

CC component such as multiple sclerosis, rheumatoid arthritis, systemic 

CC lupus erythematosus, insulin-dependent diabetes mellitus, sarcoidosis 

CC and psoriasis, and also for assaying the expression of a gene in a cell. 

CC The vector is further useful for screening of the effect of test 

CC compounds on cytokine (e.g. TGF-beta) expression of transfected cells. 

CC The present sequence is a cDNA encoding porcine TGF-betal mutant. 

XX 

SQ Sequence 1326 BP; 263 A; 438 C; 392 G; 233 T; 0 other; 

Query Match 94.9%; Score 1135.8; DB 24; Length 1326; 

Best Local Similarity 97.8%; Pred. No. 4.6e-231; 

Matches 1171; Conservative 0; Mismatches 2; Indels 24; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

II I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I li , 1 1 ! 1 1 1 : 1 1 1 II I II I II II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

Db 16 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 75 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

III II II I 'I II I I I M II I II' 

Db 76 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 13 5 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

M Ml 1 1 1 1 1 II 1 1 1 1 1 1 1 li I II 1 1 1 1 1 1 1 1 1 , 1 1 1 1 1 1,1 1 M I Ml 1 1 M M Ml 

Db 136 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTCGCC 195 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I ( I I I I I ' I I . I ' I I I I I I 
Db 196 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCCGTACTGGCTCTT 255 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

M III II 1 1 II I M 1 1 1 M I I II i I M II 1 1 1 1 M 1 1 M MM 1 1 M II M I II I Ml II 

Db 2 56 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 315 



Qy 


301 


Db 


316 


Qy 


361 


Db 


376 


Qy 


421 


Db 


436 


Qy 


481 


Db 


496 


Qy 


541 


Db 


556 


Qy 


601 


Db 


616 


Qy 


661 


Db 


676 


Qy 


721 


Db 


736 


Qy 


781 


Db 


796 


Qy 


841 


Db 


856 


Qy 


901 


Db 


892 


Qy 


961 


Db 


952 


Qy 


1021 


Db 


1012 


Qy 


1081 


Db 


1072 



GCGGACTACTACGCC^GGAGGT(^CCCGCGTGCTAATGGTGGAAAGCGGC^ACCAAATC 360 

1 1 1 1 1 1 1 1 1 !! I M I' 1 1 : 1' II 1 1 1 1 1 1 1 1 1 1 1 1 1 1! : 1 1 1 1 1 1 1 i I i 1 1 ! I II I II 

GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 375 
TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

1 1 1 1 1 1 i 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 Ml 1 1 II 1 1 M 1 1 1 II 1 1 1 1 II 1 1 1! 

TATGATAAATTCAAGGGCACCCCGCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 43 5 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 

I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I !l I I I I : I II I II I I I I I I I II I I II 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 95 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

I I I I I I I I I II I I I I I I I I I I I I I I M I I I I I I II : l I I I I I I :l I I I I I I I I I I Ml 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 555 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

1 1 1 1 1 I ! 1 1 1 I ! I 1 1 1 1 1 ! ] ! 1 1 i 1 1 1 [ I I ! 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 [ ! I ! I i 1 1 

TACCTCAGCAACCGGGTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 615 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

I II I II i II MM M M II 1 1 II 1 1 II I M II MM I IM 1 1 II I M 1 1 1 II I MM II 

ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 67 5 
GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

II II II I II II MM II I M II II I II 1 1 1 1 M MM I II M II MM 1 1 1 II I II II II 

GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 735 
TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 780 

MMMMMMMMMMMMMMMMMIMMMMMMMMMMMMI 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCGCTTCCTGCTCCTC 795 
ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 

M II I M M I IM I M I II I II II II M M II II II I II I II II I II I IM I M Mill 

ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 8 55 
GATACCAACTACTGCTTCAGCTCCACGGAGTACAAGGATGACGACGACAAGGAGAAGAAC 900 

I II M II II II III II 1 1 II II II 1 1 I E 1 1 1 1 1 1 1 1 

GATACCAACTACTGCTTCAGCTCCAC GGAGAAGAAC 891 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

MMMMMMMMMMIMMMMMMMMMMMMMMMMMMMI 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 951 
GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1011 
GACACTCAGTACAGCAAGGTCCTGGCTCTGTAC^CC^GC^CAACCCGGGCGCGTCGGCG 108 0 

MM I M M Ml 1 1 1 1 MMMII I I II I MM M DIM MM I III 1 1 MM 

GACACTC^GTACAGCAAGGTCCTGGCTCTGTAC^CCAGCACAACCCGGGCGCGTCGGCG 1071 
GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

1 1 1 1 1 1 1 1 M I ! 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 II I II I 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1131 



Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

1 1 1 ! IN 1 1' M ' 1 1; M 1 1 1 1 1 1 : l L 1 1 ! II ! I ! II 1 1 1 1 1 1 1 1 1 1 II II Ml I! 

Db 1132 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1188 

RESULT 2 
AAI58342 

ID AAI58342 standard; cDNA; 2742 BP. 
XX 

AC AAI58342; 
XX 

DT 22-OCT-2001 (first entry) 
XX 

DE Human polynucleotide SEQ ID NO 545. 
XX 

KW Human; nootropic; immunosuppressant; cytostatic; gene therapy; cancer; 

KW peripheral nervous system; neuropathy; central nervous system; CNS; 

KW Alzheimer's; Parkinson's disease; Huntington's disease; haemostatic; 

KW amyotrophic lateral sclerosis; Shy-Drager Syndrome; chemotactic; 

KW chemokinetic; thrombolytic; drug screening; arthritis; inflammation; 

KW leukaemia; ss . 

XX 

OS Homo sapiens. 
XX 

PN WO200153312-A1 . 
XX 

PD 26-JUL-2001. 
XX 

PF 26-DEC-2000; 2000WO-US34263 . 
XX 

PR 21-JAN-2000; 2000US- 048 8725 . 

PR 25-APR-2000; 2000US-0552317 . 

PR 09-JUL-2000; 2000US-0598042 . 

PR 19-JUL-2000; 2000US- 0620312 . 

PR 03-AUG-2000; 2000US- 0653450 . 

PR 14-SEP-2000; 2 000US- 0662 1 9 1 . 

PR 19-OCT-2000; 2000US-0693036 . 

PR 29-NOV-2000; 2000US- 0727344 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Asundi V, Chen R, Ma Y, Qian XB, Ren F, Wang D; 

PI Wang J, Wang Z, Wehrman T, Xu C, Xue AJ, Yang Y, Zhang J; 

PI Zhao QA, Zhou P, Goodrich R, Drmanac RT; 
XX 

DR WPI; 2001-442253/47. 

DR P-PSDB; AAM39186. 
XX 

PT Novel nucleic acids and polypeptides, useful for treating disorders 

PT such as central nervous system injuries - 

XX 

PS Claim 1; SEQ ID NO 545; 10078pp; English. 
XX 

CC The invention relates to human nucleic acids (AAI57798 -AAI61369) and 

CC the encoded polypeptides (AAM3 8642 -AAM422 13 ) with nootropic, 

CC immunosuppressant and cytostatic activity. The polynucleotides are useful 

CC in gene therapy. A composition containing a polypeptide or polynucleotide 



CC of the invention may be used to treat diseases of the peripheral nervous 

CC system, such as peripheral nervous injuries, peripheral neuropathy and 

CC localised neuropathies and central nervous system diseases, such as 

CC Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 

CC lateral sclerosis, and Shy-Drager Syndrome. Other uses include the 

CC utilisation of the activities such as: Immune system suppression, 

CC Activin/inhibin activity, chemotact ic/chemokinetic activity, haemostatic 

CC and thrombolytic activity, cancer diagnosis and therapy, drug screening, 

CC assays for receptor activity, arthritis and inflammation, leukaemias and 

CC C.N.S disorders. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification. 

XX 

SQ Sequence 2742 BP; 526 A; 938 C; 799 G; 479 T; 0 other; 



Query Match 80.1%; Score 958.2; DB 22; Length 2742; 

Best Local Similarity 88.6%; Pred. No. 2e-193; 

Matches 1060; Conservative 0; Mismatches 113; Indels 24; Gaps 1; 



Qy 


l 


ATGGCG C CTT CGGGG CTG CGG CT CTTGC CG CTGCTGCTGCCGCTG CTGTGG CTG CTAGTG 


60 






III 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III 
Ml 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III 




Db 


842 


ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 


901 


Qy 


61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


120 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 




Db 


902 


CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 


961 


Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 


180 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 III 
1 1 1 II 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill III 




Db 


962 


GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 


1021 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 


240 






1 ; IIMIIIIIIIIIIIIIIII Mill II II II II 




Db 


1022 


AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 


1081 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


300 






MINI: 1 1 Mill MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml 




Db 


1082 


TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 


1141 


Qy 


301 


GCGGACTACTACGCC^GGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGO^CCAAATC 


360 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 If i 1 1 1 1 1 1 1 1 MM Mill 




Db 


1142 


GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


1201 


Qy 


361 


TATGATAAATTC^GGG^CCCCCC^CAGCTTATATATGCTGTTO^CACGTCGGAGCTC 


420 






Mill M MIMI 1 1 MUM MINIM 1 MINIM II MINI 




Db 


1202 


TATGAGAAGTTCAAGCAGAGTACACACAGCATA 


1261 


Qy 


421 


CGGGAAG CGGTG C CGGAA C CTGTATTG CT CT CT CGGG CAGAG CTG CG CCTG CTGAGG CT C 


480 






II MINIM II INN II NIMH 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIM, Ml 




Db 


1262 


CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


1321 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


540 






i M i M m 1 1 1 1 . i ri i ii i m ii 1 1 1 1 . i ri i mm ii 1 1 1 1 1 Milium 




Db 


1322 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTAC(^GAAATACAGC^ACAATTCCTGGCGA 


1381 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


600 



MINI MINIMI III Mill Mill II MIMI I II Mil 



Db 



13 82 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 



1441 



Qy 


601 


Db 


1442 


Qy 


661 


Db 


1502 


Qy 


721 


Db 


1562 


Qy 


781 


Db 


1622 


Qy 


841 


Db 


1682 


Qy 


901 


Db 


1718 


Qy 


961 


Db 


1778 


Ov 


1021 


Db 


1838 


Qy 


1081 


Db 


1898 


Qy 


1141 


Db 


1958 



ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

Mlllllllllllllllllll III III III II II Mill llllllll II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 15 01 
GCCC^CTCTTCCTCTGAC^GCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

1 1 1 i 1 1 1 MM llllllll IIIIIIIIII II Mill II IIIIIIIIII I 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1561 
TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I llllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 llllllllllllll llllllll III 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 
ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 

'II I M I ; 1 1 1 1 1 1 i M 1 1 M 1 1 1 1 1 1 Mill I ' 1 1 , 1 1 M 1 1 1 1 1 i 1 1 1 M 1 1 1 1 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 1681 

GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 900 

II llllllll IMMIMMMM IIIIIIIIII 
GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 1717 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

M ! 1 1 1 1 1 1 1 1 1 1 1 llllllllllllll llllllll MMIMMIMM II 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 
GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II MIIIIMIIIIIIIIMI llllllll llllllll llllllll llllllll 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 
GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

Mill IMIMMIMIMIMIM llllllllllllll IIIIIIIIIII llllll 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 18 97 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

M I M M I II I I I M II I II I I II I I II I II I I M I I II . M I I I I I I I i I I I I I I I I I 
GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 

AAGCCCAAGGTGGAGC^GCTGTCCAACATGATCGTGCGTTCCTGC^AGTGCAGCTGA 1197 

1 1 1 1 1 1 1 M I M I M 1 1 1 1 1 1 1 1 1 1 M M 1 1 II 1 1 1 1 1 1 1 1 1 'I II I Ml , II 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2 014 



RESULT 3 
AAQ20289 

ID AAQ20289 standard; cDNA; 1559 BP. 
XX 

AC AAQ2 028 9; 
XX 

DT 25-MAR-2003 (updated) 

DT 16-APR-1992 (first entry) 

XX 

DE Sequence encoding simian transforming growth factor (TGF) beta-1. 
XX 

KW Hypertension therapy; hypotensive agent; blood pressure modulator; 

KW ss . 

XX 



OS 


Monkey . 




XX 






FH 


Key 


Locat ion/ Qual i f iers 


FT 


CDS 


262 282 


FT 




/*tag= a 


FT 


sia np'nritip 


283 . . 324 


FT 




/*tag= b 


FT 


CDS 


325 1098 


FT 




/*tag= c 


FT 


mat npnfirie 


1099 1436 


FT 




/*tag= d 


XX 






PN 


W09119513-A. 




XX 






PD 


26-DEC-1991 . 




XX 






PF 


20-JUN-1991; 


91WO-US04449 . 


XX 






PR 


20-JUN-1990; 


90US-0541221 . 


XX 






PA 


(BRIM ) BRISTOL-MYERS SQUIBB CO. 


XX 






PI 


Oleson FB, 


Comer eski CR; 


XX 






DR 


WPI; 1992-024199/03. 


DR 


P-PSDB; AAR20124. 


XX 






PT 


Use of transforming growth factor (TGF) -beta and their 


PT 


antagonists 


- for modulating blood pressure, for treating 


PT 


hypertension and hypotension 


XX 






PS 


Disclosure; 


Fig 1; 42pp; English. 


XX 






CC 


A new method 


for treating hypertension comprises administering a 


CC 


transforming 


growth factor (TGF) -beta to an individual at a dose 


CC 


effective for lowering blood pressure; the TGF-beta may be e.g. 


CC 


mature TGF-beta, TGF-beta2, a mature TGF-betal/beta2 hybrid, TGF- 


CC 


betal precursor, a latent TGF-beta2 precursor, hybrid TGF-beta 1 /TGF - 


CC 


beta2 precursor, a latent TGF-betal complex or a latent TGF-beta2 


CC 


complex . 




CC 


(Updated on 


25-MAR-2003 to correct PA field.) 


XX 






SQ 


Sequence 1559 BP; 300 A; 546 C; 446 G; 267 T; 0 Other; 



Query Match 79.9%; Score 956.6; DB 13; Length 1559; 

Best Local Similarity 88.5%; Pred. No. 4e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

Ml I I I I M lllllllllll lllllllllllll IIIIMIIIIIIII II III 
Db 261 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

Mlllllll I M I II II I II II Mill IMIMMIMIM f I I 1 f I I I I I I I I I I 
Db 321 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 38 0 



Qy 121 GTGAAG CGGAAG CG CATCGAGG C CATT CG CGG C CAG ATT CTG TC CAAG CTTCGGCTTGCC 18 0 



IIIIIIIIIIIIIIIIIIIII 1 1 1 1 lllllllllll lllllllllll Mill Ml 

1 GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 



Db 


381 


GTGAAG CGGAAG CG CAT CG AG AC CAT C CG CGG C CAGA T C CTGT C CAAG CTG CGG CTCG CC 


440 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 


240 






IIIIIIMIIIIIIIIIIII h 1 ill 1 III MM Mill II II II II 




Db 


441 


AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 


500 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


300 






llllllll llllllllllllll llllllll MM II lllllllllll II III 




Db 


501 


TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 


560 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


360 


Db 


561 


II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


620 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 






Mill II llllll 1 1 1 llllll llllllll 1 MMMM II llllll 




Db 


621 


TATGACTWVGTTCAAGCAGAGC^CACAC^^ 


680 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


480 






II Mill II II llllllll llllllll llllllllllllll IMMMMMI 




Db 


681 


CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


740 


Qy 


481 


AAGTTAAAAGTGGAGCAGCA.CGTGGAGCTATAC(^GAAATACAGCAATGATTCCTGGCGC 


540 






IMMMMM llllllll MMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II llllllllll 




Db 


741 


AAGTTAAAAGTCGAG(^G(^TGTGGAGCTGTAC(^GAAATACAGC^CAATTCCTGGCGA 


800 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill MM MMMM MM IIIMIIII 




Db 


801 


TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 


860 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


660 






1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 III MM Ml II II Mill llllllll II 




Db 


861 


ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


920 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACA 


720 






Mill MM IMMMMMMMMMM II II 1 II II MMMM 1 




Db 


921 


GC CCA CTGCTCCTGTGACAGCAAAGATAACAC^ 


980 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 


780 






1 MMMM IMMMMM Mill llllllllllllll llllllll III 




Db 


981 


ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


1040 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 


840 






II Mill IMIIIIMII llllllll II Mill llllll MUM 1 1 1 II Mill 




Db 


1041 


ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 


1100 


Qy 


841 


GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 


900 


Db 


1101 


II M MM MM II M llllllllll 

GACACCAACTACTGCTTCAGCTCCAC GGAGAAGAAC 


1136 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 


960 






1 1 II II 1 1 II II II 1 1 1 II IMMMMM llllllll llllllllllllll II 




Db 


1137 


TGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


1196 


Qy 


961 


GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 


1020 



I MM llllllllll Mill llllllllllllllll IIIMIIII I llllllll 



Db 



1197 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTG 12 56 



Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

Mill M i I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1! 1 1 MM I lllllllllll MINI 

Db 12 57 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1316 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

1 1 1 II I II I I II M I M M M M M MM 1 1 M I M 1 1 M 1 1 Ml I M 1 1 1 1 1 M I II 

Db 1317 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 1376 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

I Ml I M I II I M M I M 1 1 II I II I II II I Ml I II llllllll lllllllll 

Db 13 77 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAATGCAGCTGA 1433 



AAQ03268; 

25-MAR-2003 (updated) 
12-AUG-1990 (first entry) 

Simian transforming growth factor-beta cDNA . 
Transforming growth factor-beta; psoriasis; TGF-beta; ss . 
Monkey . 



RESULT 4 
AAQ03268 

ID AAQ03268 standard; DNA; 1561 BP. 
XX 
AC 
XX 
DT 
DT 
XX 
DE 
XX 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 



Key 

sig_peptide 
mat_peptide 



EP353772-A. 
07-FEB-1990 . 

04- AUG-1989; 

05- AUG-1988; 



Locat ion/ Qual i f i ers 
283 . .324 
/*tag= a 
1096 . . 1431 
/*tag= b 

/product =human transforming growth factor-beta 



89EP-0114458 . 
88US-0229133 . 
(ONCO ) ONCOGEN LP. 

Twardzik DR, Purchio AF, Ranchalis JE, Stevens V; 

WPI; 1990-038499/06. 
P-PSDB; AAR03743. 

Inhibition of proliferation of epidermal cells - 
used to treat psoriasis by contacting cells with compositions 
containing transforming growth factor-beta. 

Disclosure; fig 1; 20pp ; English. 



XX 

CC TGF-beta may be used in the treatment of hyperplasia 

CC associated with acanthosis -categorised skin diseases, and 

CC in alleviating psoriatic symptoms associated with cytokine- 

CC induced phenomena. See also AAQ03269 and AAR03750. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

XX 

SQ Sequence 1561 BP; 301 A; 547 C; 446 G; 267 T; 0 other; 



Query Match 79.9%; Score 956.6; DB 11; Length 1561; 

Best Local Similarity 88.5%; Pred . No. 4e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III MM II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M III 

Db 2 62 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 321 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

MINIMI IIMMMIMM MIM MMIMMIMM i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 322 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 3 81 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM 1 1 1 1 1 1 1 1 1 1 1 MM M MIM Ml 

Db 3 82 GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 441 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill II II M II 

Db 4 42 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 501 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

llllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II M 1 1 MM II 1 1 M I If 1 1 1 1 II III 

Db 5 02 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 561 

Qy 3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAAT^ 3 60 

M 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1! I I MM Mill 

Db 562 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 621 

Qy 3 61 TATGATAAATTO^GGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill II MUM I I I MINI f 1 1 1 1 1 1 1 I llllllll II MUM 

Db 622 TATGA CAAGTT CAAG CAGAG CA CA CA CAG CATATATATGTTCTT CAA CA CATCAG AG CT C 681 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGG CAGAG CTGCGCCTGCTGAGGCTC 48 0 

II Mill II II IIIIIMI llllllll MMIMMIMM M MM M M MM 

Db 682 CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 741 

Qy 4 81 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 M M IMMMIII 

Db 742 AAGTTAAAAGTGGAGCAGC^TGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 8 01 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

1 1 II 1 1 II 1 1 II M I II II 1 1 II Mill MM I f 1 1 1 1 1 1 1 MM lllllllll 

Db 8 02 TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 8 61 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III MM III II II Mill llllllll II 

Db 8 62 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 921 



Qy 

Db 



661 GCCCACTCTTCCTCTGAC^GC^WVGATAAC^C^CTCCACGTGGAAATTAACGGGTTC^T 

Mill 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 ii 1 1 : i ii 1 1 ii iiiii ii iiiiiiiiii I 

922 GCCCACTGCTCCTGTGACAGCAAAGATAAC^CACTGCAAGTGGACATCAACGGGTTCACT 



720 
981 



Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I llllllll I I I I I I I I I I I I I IIIII I I I ' I I I II I I I I I llllllll Ml 
Db 982 ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1041 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 

iiiiiiiiiii 1 1 j 1 1 1 1 r j r 1 1 1 1 ii iiiii , i M in : 1 1 1 1 1 1 1 1 ii 1 1 1 mi 

Db 104 2 ATGGCCACCCC^CTGGAGAGGGCCC^CATCTGCA^GCTCCCGGCACCGCCGAGCCCTG 1101 

Qy 841 GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 900 

II lllllllllllllllllllllll Illlllllll 
Db 1102 GACACCAACTACTGCTTCAGCTCCAC GGAGAAGAAC 113 7 

Qy 901 TG CTG CGTG CGG CAG CTCTACATTGA CTT C CGGAAGGA C CTGGG CTGGAAGTGGATT CAT 960 

1 1 1 1 1 , 1 1 1 M 1 1 1 1 i II IIIIIIIIIII llllllll llllllllllllll II 

Db 113 8 TG CTG CGTG CGG CAG CTGTATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1197 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II llllllllllllllllllll llllllllllllllllllllllllll llllllll 
Db 1198 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTG 1257 

Qy 1021 GA(^CT(^GTA(^GCAAGGTCCTGGCTCTGTA(^ACCAGCACAACCCGGGCGCGTCGGCG 108 0 

IIIII llllllllllllllllllll llllllllllllll IIIIIIIIIII MINI 
Db 1258 GACACGCAGTAC^GC^AGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1317 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

I! II II I II II I II I M 1 1 II I M 1 1 1 1 1 II III 1 1 M I II MM I II M 1 1 III MMM 

Db 1318 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 13 77 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

1 1 M 1 1 1 1 M 1 1 1 1 1 1 M Ml I MM I i 1 1 1 1 1! M M , llllllll lllllllll 

Db 1378 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAATGCAGCTGA 14 34 



RESULT 5 




AAQ13392 




ID 


AAQ13392 standard; DNA; 1821 BP. 


XX 






AC 


AAQ13392; 




XX 






DT 


20-NOV-1991 


(first entry) 


XX 






DE 


Human pro-TGF 


-beta 1 gene. 


XX 






KW 


Osteogenetic; 


tumoricidal ; ss. 


XX 






OS 


Homo sapiens . 




XX 






FH 


Key 


Location/Qual if iers 


FT 


CDS 


512 . . 1684 


FT 




/*tag= a 


FT 


sig_peptide 


512 . . 598 


FT 




/*tag= b 


FT 


misc RNA 


599. .1684 



FT /*tag= c 

FT /note= 11 pro -TGF- bet a 1" 

FT mat_peptide 1346.. 1684 

FT " /*tag= e 

FT /note= "TGF-beta 1" 

XX 

PN JP03180192-A. 
XX 

PD 06-AUG-1991. 
XX 

PF 07-DEC-1989; 8 9 JP- 03 18243 . 
XX 

PR 07-DEC-1989; 89JP-0318243 . 
XX 

PA (KIRI ) KIRIN BREWERY KK. 
XX 

DR WPI; 1991-271579/37. 
DR P-PSDB; AAR13 813. 
XX 

PT Human pro-TGF-beta 1 prodn . , for osteo-genet ic activity - by 
PT preparing DNA chain contg . base sequence coding for human 
PT pre:pro-TGF-beta 1, forming expression vector etc. 
xx 

PS Claim 1; Fig 1; 16pp; Japanese. 
XX 

CC The DNA sequence encodes human prepro-TGF-beta 1 which can be 
CC produced by recombinant methods, it has osteogenetic and 
CC tumoricidal activity. 
XX 

SQ Sequence 1821 BP; 326 A; 679 C; 508 G; 308 T; 0 other; 



Query Match 79.9%; Score 956.6; DB 12; Length 1821; 

Best Local Similarity 88.5%; Pred. No. 4.1e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III I I I I II I MM 11111,11:1111 MINIUM II III 
Db 512 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 571 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

lllllllllllll MINIMI Mill : 1 1 - 1 1 1 1 1 1 ' I ; I 1 1 II II 1 1 1 Mill 

Db 572 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 631 

Qy 121 G TG AAG CGGAAG CG CAT CGAGG CCATTCGCGGC CAGAT TCTGT C CAAG CTT CGG CTTG C C 18 0 

MIMIMIIMMMMIMIMM IMMIIMM IIMMIMM 1 1 1 1 1 III 

Db 632 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTGGCC 691 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

1 1 II h M I II I M I I I 1 1 1 1 1 M I II I I II I M 1 1 I 1 1 I I II II M II 

Db 6 92 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 751 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

MINIM IMIIMIMIIM lllllll 1 1 1 1 MINIM MM III I IM 

Db 752 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 811 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II INN III lllllll IMIMMMI IMIIIIIIIIIII II I 1 1 1 1 Mill 



Db 


812 


GCCGACTACTACGCCAAGGAGGTCA.CCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


871 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 


Db 


872 


Mill II MINI 1 1 IMIII MMMM 1 lllllill II MUM 

TATGAC^GTTC^GCAGAGTACAC^C^GCATATATATGTTCTTC^ACACATCAGAGCTC 


931 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

II MMMM M Mill II MMMM MIMMIMIMI 1 1 1 1 1 1 1 1 1 1 1 1 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


480 


Db 


932 


991 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

!l 1 1 1 1 1 1 1 1 1 1 ! Ml 1 1 1 1 II 1 1 1 ! l 1 1 II 1 i II 1 III Ml ! M MIIIMIM 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 


540 


Db 


992 


1051 


Qy 

Db 


541 
1052 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

1 II II 1 1 II II 1 1 II II 1 1 II II Mill Mill II IMIII 1 II lllllill 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 


600 
1111 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


660 


Db 


1112 


MIMMMMIMMIMM Ml Ml Ml M II MMI MMMM II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


1171 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

lllllll MM MMMM 1 1 II 1 II II M M Mill M IMIIIIMI 1 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 


720 


Db 


1172 


1231 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTGCTC 

1 MMMM MM ' Mill II, MIMMIMIMI MMMM III 

ACCGGCGGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


780 


Db 


1232 


1291 


Qy 

Db 


781 
1292 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 

1 II MM IM 1 1 1 1 1 1 II 1 1 Ml 1 1 1 II Mill Ml M 1 II 1 1 1 II 1 1 1 MM 1 1 M 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGGCGAGCCCTG 


840 
1351 


Qy 


841 


GATAGCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 

II lllllill MIMMIMIMI MIIIMIM 

GACACCAACTATTGCTTCAGCTGCAC GGAGAAGAAC 


900 


Db 


1352 


1387 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

1 1 1 1 1 II 1 1 M 1 1 1 1 II 1 r 1 1 1 1 1 1 1 1 1 1 ! 1 MMMM IIIMIMMIMI II 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


960 


Db 


1388 


1447 


Qy 


961 


GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 

M MIMMIIIMIIMMM MMMM MMMM MMMM MMMM 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 


1020 


Db 


1448 


1507 


Qy 


1021 
1508 


GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGC^CAACCCGGGCGCGTCGGCG 

Mill MIMIMIMIMIIMM MIMMIMIMI lllllllllll IMIII 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 


1080 
1567 


Qy 

Db 


1081 
1568 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 

1 Ml 1 1 II M 1 1 1 1 II II 1 II II Ml II MM 1 1 M MM MM M III 1 1 1 II 1 1 1 II 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 


1140 
1627 


Qy 


1141 


AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II M 1 II 1 II 1 1 1 II 1 M M 1 1 II III 1 1 1 II 1 1 II 1 1 1 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 16 84 


Db 


1628 



RESULT 6 
AAN60972 

ID AAN60972 Standard; CDNA; 2537 BP. 
XX 

AC AAN60972; 
XX 

DT 31-OCT-2002 (updated) 

DT 28-OCT-1991 (first entry) 

XX 

DE Sequence encoding preTGF-beta. 
XX 

KW Transforming growth factor beta; cancer; wound healing. 
XX 

OS Unidentified. 
XX 

FH Key Location/Qualifiers 
FT misc_structure 37.. 113 
FT /*tag= a 

FT /note= "Sequence can form stable hairpin loops" 

FT CDS 842.. 2014 

FT /*tag= b 

FT mat_peptide 1676.. 2011 

FT /*tag= c 

XX 

PN EP200341-A. 
XX 

PD 10-DEC-1986. 
XX 

PF 21-MAR-1986; 86EP- 03 021 12 . 
XX 

PR 22-MAR-1985; 85US- 0715142 . 
PR 13-MAR-1987; 87US- 0025423 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA; 
XX 

DR WPI; 1986-326875/50. 
DR P-PSDB; AAP61468. 
XX 

PT TGF-beta prodn. from transformed hosts - useful esp. for treating 

PT wounds (J6 2/9/86) . 

XX 

PS Disclosure; Fig lb; 26pp; English. 
XX 

CC The gene product is known to stimulate cell proliferation and 
CC inhibit anchorage-dependent growth of a variety of human cancer cell 
CC lines, it is esp. useful in treatment of burns and the promotion of 
CC surface and internal wound healing. TGF-beta may be expressed from a 
CC transformed CHO cell line. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 
XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 



Query Match 



79.9%; Score 956.6; DB 7; Length 2537; 



Best Local Similarity 88.5%; Pred . No. 4.3e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 

ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III I I I I II lllllllllll ! I I I I II I I I i I I I I > I 1 I ' I I ' I I M I | | 

ATG CCG C CCT CCGGG CTG CGG CTG CTG C CG CTG CTG CTAC CG CTG CTGTGG CTACTGGTG 901 

CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

lllllllllllll IIMIIIII Mill llllllllllllll 1 1 1 M 1 1 1 1 1 1 1 1 i I 

CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 
GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

IIIIIIIIIIIIIIIIMIIMMII lllllllllll IMIIIIMII Mill Ml 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 
AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

MMIMMIMMMIMI 1 1 1 1 1 1 ! 1 1 1 1 Mill! 1 1 Mill II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 1081 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 0 0 

MIIIMI llllllllllllll 1 1 1 1 1 1 1 1 MM IIIIIIMIIIIIIMI III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 36 0 

II 1 1 : T I ! M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 II I II 1 1 M 1 1 1 I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 12 01 
TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill M MUM I I MUM I II I II II I 1 1 1 1 , 1 1 1 II llllll 

TATGACAAGTTC^GC^GAGTACAC^CTVGC^TATATATGTTCTTCAACACATCAGAGCTC 1261 
CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 

II f 1 1 1 1 1 1 1 II Mill II MIIIMI III I II 1 1 1 1 1 1 1 i IIMIIIIIIII 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 1321 
AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

1 1, L " l| , 1 1, 1 1 , | ; | ; | iiiiiM iMMii in | , 1 1 1 1 1 1 1 1 M I II I 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 13 81 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 6 0 0 

1 1 1 1 1 1 ! i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill Mill M llllll I II MINIMI 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1441 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

M 1 1 1 > I M 1 1 II I M 1 1 1 1 Ml Ml Ml M M Mill MIIIMI II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1501 
GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

I MM I i MM MIIIMI lllllllllll II Mill II IIIIIM I Ml 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1561 
TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 7 8 0 

I MIIIMI I II I M I M II I II M 1 1 M MMIMMMMI IMIMM Ml 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 
781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 



Qy 


i 


Db 


842 


Qy 


61 


Db 


902 


Qy 


121 


Db 


962 


Qy 


181 


Db 


1022 


Qy 


241 


Db 


1082 


Qy 


301 


Db 


1142 


Qy 


361 


Db 


1202 


Qy 


421 


Db 


1262 


Qy 


481 


Db 


1322 


Qy 


541 


Db 


1382 


Qy 


601 


Db 


1442 


Qy 


661 


Db 


1502 


Qy 


721 


Db 


1562 


Qy 


781 



Db 



1622 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 1681 



Qy 841 GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 900 

1 1 MINIM IMMMMMMI M I II M M I 

Db 1682 GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 1717 

Qy 901 TG CTG CGTG CGG CAG CTCTACATTGA CTT C CGG AAGGA C CTGGG CTGG AAG TGGATTCAT 960 

1 1 1! 1 1 II 1 1 1 1 1 1 1 1 1 MIIIIIMIIMI M 1 1 1 1! I II M 1 1! 1 1 1 1 1 1 1 M 

Db 1718 TGCTG CGTG CGG CAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

1 1 IIIIIIIIIIIIIIIIIIII IIIIMII llllllll llllllll llllllll 

Db 1778 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 

Qy 1021 GACACTCAGTACAGGAAGGTCCTGGCTCTGTACAACC^GCACAACCCGGGCGCGTCGGCG 108 0 

Mill 1 1 M M I M II I M II I M I II M I M II 1 1 M I I II 1 1 1 1 M M IMMI 

Db 1838 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 18 97 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 II M 1 1 II 1 1 1 1 1 1 1 1 1 I ! 1 1 1 1 1 < 1 1 1 ! 1 1 1 1 1 1 1 1 < I ! . 1 1 

Db 1898 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 

Qy 1141 AAG C CCAAGGTGGAG CAG CTGTCCAACATGAT CGTG CGTTC CTG CAAGTG CAG CTGA 1197 

M I f M 1 1 1 1 1 II I II 1 1 1 II I M 1 1 1 1 1 1 1 1 1 1 1 1 II lllllllllllillllll 

Db 1958 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2 014 

RESULT 7 
AAQ03301 

ID AAQ03301 standard; DNA; 2537 BP. 
XX 

AC AAQ03301; 
XX 

DT 25-MAR-2003 (updated) 

DT 05-AUG-1990 (first entry) 

XX 

DE cDNA encoding human pre- transforming growth factor-beta - 1 (pre-TGF-beta- 

DE 1) . 

XX 

KW Transforming growth f actor-beta-1 (TGF-beta-1) ; 

KW neoplastic cell line inhibition; 

KW EGF-potentiated anchorage -independent growth; 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 842 . .2014 

FT /*tag= a 

FT mac_peptide 1676.. 2011 

FT /*tag= b 

FT misc_dif ference 37.. 113 

FT /*tag= c 

FT /note="stable hairpin loops" 

FT misc_feature 2015.. 2100 

FT /*tag= d 

FT /note="G-C rich sequence 

FT and a downstream TATA- like sequence" 



XX 

PN US4886747-A. 
XX 

PD 12-DEC-1989. 
XX 

PF 13-MAR-1987; 8 7US - 0 02 5423 . 
XX 

PR 13-MAR-1987; 87US- 0025423 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1990-051338/07. 

DR P-PSDB; AAR05258. 
XX 

PT Nucleic acid encoding transforming growth factor-beta - 

PT cloned into expression vectors for expression in eukaryotic host 

PT cells for therapeutic use 

XX 

PS Disclosure; Fig lb; 28pp; English. 
XX 

CC It was obtained by an analysis of several overlapping cDNAs and gene 

CC fragments, leading to the detn. of a continuous sequence corresp. to the 

CC TGF-beta-1 precursor mRNA. It is useful in constructing vectors that 

CC encode biologically active transforming growth factor (TGF-beta) , 

CC operably linked to DNA that encodes a secretory leader (SL) . It, or a 

CC nucleic acid capable of hybridising with it, can also be labelled and 

CC used in diagnostic assays for DNA or mRNA encoding TGF-beta or related 

CC proteins. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 



Query Match 79.9%; Score 956.6; DB 11; Length 2537; 

Best Local Similarity 88.5%; Pred. No. 4.3e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM M MMIMMM 1 1 1 1 1 M ! I II M MMMMMMM M Ml 

Db 842 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

1 1 1 11 1 1 1 1 1 1 1 1 MMMMI MMI MMMMMMM IMMMMMMM 

Db 9 02 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 



Db 962 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

I M M M I M I M I M I M I M I II M I II M I II 1 1 1 1 1 MMI M II II M 

Db 1022 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 1081 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

MMMM MMMMMMM MMMM MM 1 1 II M 1 1 1 M 1 1 1 1 1 1 III 

Db 1082 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 



Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
ii i i i i i i i i i i i i i i i i i i i i i i i i i i i i 1 i i i i i i i i i i i i i i i i i i i i i i i i 

GCCGACTACTACGCCAAGGAGGTO\CCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


360 


Db 


1142 


1201 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 


Db 


1202 


Mill M 1 1 II 1 1 1 1 1 1 1 1 1 1 llllllll 1 llllllll II 1 1 1 1 II 
i i i i i ii i i i t i i i i i i i i i i iiiiiiii i iiiiiiii ii i i i i i i 

TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 


1261 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


480 


Db 


1262 


M llllllll M Mill II llllllll llllllllllllll 1 1 1 1 1 1 1 1 1 II 1 

II llllllll 11 1 1 1 1 1 II llllllll llllllllllllll 1 1 1 1 1 1 1 1 1 1 1 1 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


1321 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


540 


Db 


1322 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 


1381 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 Mill II 1 II II 1 1 II II II II 1 1 1 

1 1 1 1 l 1 1 1 1 1 1 1 1 l 1 l 1 l 1 1 1 1 1 l l 1 1 1 1 1 1 1 1 1 I 1 1 l 1 1 1 1 II M II II 1 1 1 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 


600 


Db 


1382 


1441 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


660 


Db 


1442 


II M II II 1 1 1 1 M 1 1 II 1 II Ml Ml Ml M II Mill II II II II M 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


1501 


Qy 


661 


GCCC^CTCTTCCTCTGAC^GCT^GATAACACACTCC^CGTGGAAATTAACGGGTTCAAT 

II 1 II II MM II II II 1 1 II 1 II II II 1 1 M Mill II 1 II II II 1 II 1 

1 1 1 1 1 1 1 1 1 1 1 llllllll 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 


720 


Db 


1502 


1561 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 
1 llllllll 1 1 II 1 1 1 II II II 1 II 1 II 1 II II II II II 1 II 1 llllllll III 


780 


Db 


1562 


i iiiiiiii i i i i i i i i i i i I I i I I i I I I llllllllllllll llllllll III 
ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


1621 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 


840 


Db 


1622 


II M M 1 M 1 II II II II 1 II II M II II Mill M II II II M II M M II M M M 

1 1 1 M II 1 1 II II 1 1 II t II II 1 | | II | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 


1681 


Qy 


841 


GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 

II II II II II II II II 1 1 1 M 1 II 1 1 1 II 1 II 1 1 
ii iiiiiiii iiiiiiiiiiiiii i i I 1 I l l l i I 

GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 


900 


Db 


1682 


1717 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 
1 II II II II II II II II 1 II 1 1 1 II II 1 1 II 1 1 II II 1 1 1 II 1 1 II II 1 II II II 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllllllllllll llllllll llllllllllllll 11 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


960 


Db 


1718 


1777 


Qy 


961 


GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 
ii i i i i i i i i i i i i i i i i i i i i iiiiiiii iiiiiiii iiiiiiii iiiiiiii 


1020 


Db 


1778 


II IMMIMMIMMIMM MMMM MMMM MMMM MMMM 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTOGAGCrTG 


1 837 


Qy 


1021 


GACACTCAGTACAGOVAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 


1080 


Db 


1838 


Mill II II M M II II 1 M II 1 M llllllllllllll 1 M II M II M MUM 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 


1897 


Qy 


1081 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 

IIIIIIIIIIIIIIIIIIIMMIIIIIMM IIIIIIIIIIIIIIIIIMIIIIIIM 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 


1140 


Db 


1898 


1957 



Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

MIIIIIMIMMIIMIMMMMIIIIIMIMI M I M 1 1 1 1 M I II 11 II 

Db 1958 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2014 



AAQ02814; 

25-MAR-2003 
31-OCT-2002 
31-MAY-1989 



(updated) 
(updated) 
(first entry) 



Sequence of pre-TGF-betal cDNA. 

Transforming growth factor beta-3 (TGF beta 3); tumour cells; growth 
inhibition . 



RESULT 8 
AAQ02814 

ID AAQ02814 standard; CDNA ; 2537 BP. 
XX 
AC 
XX 
DT 
DT 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
PT 



Homo sapiens . 

Key 
CDS 



CDS 

GCjsignal 
misc_f eature 

stem_loop 
misc_f eature 

WO8912101-A. 
14-DEC-1989 . 
08-JUN-1988; 
08-JUN-1988; 



Lo ca t i on / Qua 1 i f l e r s 
842 . .2011 
/*tag= a 

/label=pre-TGF beta 1 
1677. .2011 
/*tag= b 

/label=mature TGF-beta 1 
2015. .2092 
/*tag= c 
2093 . .2099 
/*tag= d 

/ label =TATA- like sequence 
37. .113 
/*tag= e 
863. . 911 
/*tag= f 

/ label hydrophobic domain 



88WO-US01945. 
88WO-US01945. 



(GETH ) GENENTECH INC. 

Dernyck RMA, Goeddel DV; 

WPI; 1990-007474/01. 
P-PSDB; AAR04034. 

Nucleotide sequence encoding transforming growth factor beta-3 -used as a 
probe, or to produce TGF beta 3, for growth inhibition of certain normal 



PT and neoplastic cells, eg A549. 
XX 

PS Disclosure; Fig. lb; 61pp; English. 
XX 

CC Sequence encodes the 390 amino acid (AA) precursor transforming growth 

CC factor-beta 1 (pre-TGF-beta 1) polypeptide. The 5' untranslated region of 

CC the TGF-beta 1 mRNA is 841 bases long, is purine rich and has a region of 

CC potential secondary structure. The TATA- 1 ike sequence in the 3' untrans- 

CC lated region of the gene is presumably a polyadenylation signal. Mature 

CC TGF-beta 1 comprises the C-terminal 112 AA's of pre-TGF-beta 1 and is 

CC cleaved at the Arg-Arg dipeptide preceding its NH2 terminus. The nucleic 

CC acid encoding the second subtype of TGF-beta (TGF-beta 3) is useful as a 

CC probe or to produce TGF-beta 3 for inhibition of growth of normal and 

CC neoplastic cells. 

CC (Updated on 31-OCT-2002 to add missing OS field.) 

CC (Updated on 25-MAR-2003 to correct PR field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 
XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 



Query Match 79.9%; Score 956.6; DB 11; Length 2537; 

Best Local Similarity 88.5%; Pred. No. 4.3e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 





1 


III INI 1 1 lllllllllll lllllllllllll llllllllllllll II III 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 


60 


Db 


842 


901 


Qy 


61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 

lllllllllllll lllllllll Mill llllllllllllll lllllllllllllll 

CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 


120 


Db 


902 


961 


Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

M 1 1 ' 1 1 II 1 M 1 II 1 II 1. M 1 1 1 1 1 MIIIIIIIM lllllllllll Mill Ml 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 


180 


Db 


962 


1021 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

1 1 ! II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill lllllllll Mill II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 


240 


Db 


1022 


1081 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

MIMIII llllllllllllll MIMIII MM lllllllllllllllll III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 


300 


Db 


1082 


1141 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTC^CCCGCGTGCTAATGGTGGAAAGCGGCAACC^lAATC 

II IMIIIIIIIIMIIIMIIIIIMIIIMMIIIIMMIII 1 MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


360 


Db 


1142 


1201 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Mill II llllll 1 1 llllll MIMIII 1 II II 1 M 1 II IIMII 

TATGACAAGTTCAAGCAGAGTACACACAGC^ 


420 


Db 


1202 


1261 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

II MIMIII II Mill II MIMIII llllllllllllll llllllllllll 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


480 


Db 


1262 


1321 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


540 



llllllll MIIIIIIIIIIIIIMIIII J 1 1 . 1 1 1 1 1 1 [ 1 1 1 1 1 Mlllllill 

Db 1322 AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 13 81 



Qy 


541 


Db 


1382 


Qy 


601 


Db 


1442 


Qy 


661 


Db 


1502 


Qy 


721 


Db 


1562 


Qy 


781 


Db 


1622 


Qy 


841 


Db 


1682 


Qy 


901 


Db 


1718 


Qy 


961 


Db 


1778 


Qy 


1021 


Db 


1838 


Qy 


1081 


Db 


1898 


Qy 


1141 


Db 


1958 



TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 60 0 

1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill Mill M MMM I II IMMMM 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1441 

ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III III III II M Mill llllllll M 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 15 01 

GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 720 

1 1 1 1 1 1 1 MM llllllll 1 1 1 1 1 1 1 II I ! II Mill II Mlllllill I 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1561 

TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I llllllll IMIIIIMIIIIIIIIIII llllllllllllll llllllll III 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 

ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 

MMMMIMMMMMMIMIMM Mill II 1 1 1 II 1 1 II I II 1 1 1 II II 1 1 1 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 1681 

GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 900 

ii iiiMiii 1 1 1 1 1 1 1 1 1 1 1 r 1 1 Minimi 

GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 1717 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMMMIMI llllllll 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 II 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 

GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II I ' llllllll llllllll llllllll llllllll 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 

GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

Mill IIIIIIMIIIIIIIIMII i r 1 1 1 1 1 1 1 1 1 1 1 1 I M I II 1 1 1 1 1 MMM 

GACACGC^GTACAG(^GGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1897 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

M M M I II I M I M I M II I II I I II II I I I MIMIIMIIIMIMIIIMIIIII 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

r r r 1 1 1 1 1 1 1 1 1 1 j i r 1 1 r 1 1 1 r j r 1 1 1 r 1 1 1 1 1 1 1 1 r immmmmmimm 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2014 



RESULT 9 
AAT15720 

ID AAT15720 standard; cDNA; 2537 BP. 
XX 

AC AAT15720; 
XX 

DT 25-MAR-2003 (updated) 
DT 24-JUL-1997 (revised) 
DT 25-JAN-1980 (first entry) 



XX 






DE 


Pre- trans forming growth factor beta 1 cDNA. 


XX 






KW 


transforming growth factor beta 1; wound healing; 


KW 


recombinant production; ss. 


XX 






OS 


Homo sapiens . 




XX 






FH 


Key 


Location/ Qual if iers 


FT 


5 1 UTR 


1 . .841 


FT 




/*tag= a 


FT 


misc feature 


37 . .113 


FT 




/*tag= d 


FT 




/note= "GC-rich region forms stable hairpin loops; 


FT 




similar to structural organ i sat ion of c -myc RNA 


FT 




could play role in mRNA stability or in 


FT 




regulation of transcription" 


FT 


CDS 


842 . .2014 


FT 




/*tag= c 


FT 




/product^ pre-TGF_beta_l 


FT 


mat_pept ide 


1676 . . 2011 


FT 




/*tag= a 


FT 




/product = mature_TGF_beta_l 


FT 


repeat region 


2015 . . 2100 


FT 




/*tag= e 


FT 




/note= "GC-rich region; possibly responsible for the 


FT 




fact 3 1 UTR ot mRNA could not be cloned as cDNA; 


FT 




may be important for transcription efficiency" 


FT 


repeat_unit 


2019 . .2023 


FT 




/*tag= f 


FT 


TATA_signal 


2094 . .2100 


FT 




/*tag= g 


FT 




/note= "TATA- like sequence; no evidence that this 


FT 




functions a promoter" 


FT 


polyA_signal 


2514 . .2520 


FT 




/*tag= h 


FT 


misc_signal 


2529 . . 2536 


FT 




/*tag= i 


FT 




/note= "consensus sequence immediately precedes 


FT 




polyA-tail (Benoist et al) " 


XX 






PN 


US5482851-A. 




XX 






PD 


09-JAN-1996. 




XX 






PF 


05-NOV-1993; 


93US-0147364 . 


XX 






PR 


13-MAR-1987; 


87US-0025423 . 


PR 


22-MAR-1985; 


85US-0715142. 


PR 


04-AUG-198 9; 


89US-0389929 . 


PR 


04-MAR-1992; 


92US-0845893 . 


PR 


05-NOV-1993; 


93US-0147364 . 


XX 






PA 


(GETH ) GENENTECH INC. 


XX 






PI 


Derynck RMA, 


Goeddel DV; 


XX 







DR WPI; 1996-076891/08. 

DR P-PSDB; AAR90827. 
XX 

PT New recombinant human transforming growth factor-beta prods. - produced 

PT using Chinese hamster ovary cells, for use in diagnostic applications 

PT or in therapy 
XX 

PS Example 3; Fig 1; 2 6pp; English. 
XX 

CC The cDNA encodes the pre- transforming growth factor (TGF) beta 1 protein. 

CC The nucleotide sequence was obtd. by an analysis of several overlapping 

CC cDNAs and gene fragments. The DNA is useful for the recombinant 

CC production of TGF beta 1, which can be used in, e.g. wound healing. 

CC (Revised entry submitted to correct sequence analysis breakdown.) 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 2537 BP; 473 A; 893 C; 739 G; 432 T; 0 other; 



Query Match 79.9%; Score 956.6; DB 17; Length 2537; 

Best Local Similarity 88.5%; Pred. No. 4.3e-193; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III I I I I II lllllllllll I II ! I I ! I I I I I I I I I I . I : I I I > i I I II III 
Db 842 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 9 01 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

I Ml Ml MINI MIMMM I MM II I MM Mill II 1 1 M 1 1 II 1 1 ! 1 1 1 1 

Db 902 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllllllll MM MM Mill III 

Db 962 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill M II II II 

Db 1022 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 1081 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

MMMM MMMMMMM IIIIMM MM 1 1 1 1 1 1 1 1 M II 1 1 1 M Ml 

Db 1082 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 



Qy 3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 ! I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I MM Mill 

Db 114 2 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 1201 

Qy 361 TATGATAAATTC^GGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill II MMM I I MUM MMMM I MMMM !! !!!!!! 

Db 12 02 TATGACAAGTTCAAGCAGAGTACACACAGC^ 1261 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 

II MMMM II Mill II MMMM 1 1 1 1 1 1 1 1 1 1 1 II I MMM 

Db 12 62 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 1321 

Qy 4 81 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

I M II II II I I I I I I I II II I I M II I M lllllllllllllllll I I I I I I I I I I 
Db 1322 AAGTTAAAAGTGGAGCAG^CGTGGAGCTGTACCAGAAATAO^GCAACAATTCCTGGCGA 13 81 



Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 600 

I i 1 1 i 1 1 1 1 1 II M 1 1 1 1 1 > 1 1 1 him Mill II MINI I II MINIMI 

Db 1382 TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1441 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 Ml III III II II Mill 1 1 1 1 1 1 1 1 II 

Db 1442 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 1501 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

IMMM MM MMMM MIMMMM II Mill M MMMIMI I 

Db 15 02 GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 1561 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I MMMM 1 1 1 II 1 1 1 1 1 M II 1 1 1 1 1 1 MIMMMIMM llllllll III 

Db 1562 ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 1621 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 

IIIIIIIIIIIIMMIIIIIIIIIIIII Mill MMMMIMMIMMMIMI 

Db 1622 ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 1681 

Qy 841 G ATA C CAACTA CTG CTT CAG CTCCACGGACTACAAGGATGACG A CGACAAGGAGAAGAAC 90 0 

II MMMM MIMMMIMM MMMIMI 

Db 1682 GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 1717 

Qy 901 TG CTG CGTG CGG CAG CTCTA CATTG A CTTC CGGAAGGA C CTGGG CTGGAAGTGGATT CAT 960 

1 1 ,' i U 1 1 1 1 h, I II I 1 , 1 M 1 1 1 1 1 1 M I MMMM MIMMMIMM II 

Db 1718 TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1777 

Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 1020 

II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMM MMMM MMMM MMMM 

Db 1778 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 

Qy 1021 GACACTCAGTAaG(^GGTCCTGGCTCTGTA(^CC^GC^CAACCCGGGCGCGTCGGCG 108 0 

Mill IIMIIIIIIIMIIIMM 1 1 1 1 1 1 1 ! 1 1 ! 1 1 1 MIMMMM llllll 

Db 183 8 GA(^CGCAGTA(^GC^GGTCCTGGCCCTGTAC^ACCAGCATAACCCGGGCGCCTCGGCG 18 97 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

1 1 1 1 1 ! 1 1 1 M 1 1 1 ! 1 1 , 1 1 1 1 1 1 1 1 1 ! 1 1 MIIMMIIIIIIIIIIIIIIIIIII 

Db 18 98 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 

Qy 1141 AAGCCC^GGTGGAGC^GCTGTCCT^C^TGATCGTGCGTTCCTGa^AGTGCAGCTGA 1197 

IMIMIIIIMMIMMMIIIIIIMIIIIIIIM IIIIMIIIIIIMIMI 

Db 1958 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2 014 



RESULT 10 
ABQ76674 

ID ABQ76674 standard; DNA; 2527 BP. 
XX 

AC ABQ76674; 
XX 

DT 26-MAR-2003 (first entry) 
XX 

DE Androgen receptor signalling pathway-associated DNA E00973. 
XX 

KW Androgen receptor; transact ivation ; modulator; Smad3 ; Smad4 ; Akt; TGF-B; 



KW signal transduction pathway; transforming growth factor-B; phosphatase; 

KW tensin; cytostatic; antiproliferative; cellular proliferation; cancer; 

KW E00973; ds . 
XX 

OS Synthetic. 
XX 

PN WO200282081-A2 . 
XX 

PD 17-OCT-2002. 
XX 

PF 05-APR-2002; 2 002WO-US1 1 086 . 
XX 

PR 06-APR-2001; 200 1US-282266P . 

PR 13-MAR-2002; 2 002US-365060P . 
XX 

PA (UYRP ) UNIV ROCHESTER. 
XX 

PI Chang C; 
XX 

DR WPI; 2003-046871/04. 
XX 

PT Modulating androgen receptor activity, by administering a compound that 

PT modulates receptor activity, inhibits receptor-signal transduction 

PT pathway/receptor-coactivator interaction or changes amount or receptor 

PT - 

XX 

PS Disclosure; Page 225-226; 302pp; English. 
XX 

CC This invention describes a novel method for modulating androgen receptor 

CC activity or androgen receptor-mediated transactivation activity in a 

CC cell. The method involves administering a compound which causes 

CC modulation of the androgen receptors activity and the inhibition of 

CC interaction between the receptor and a protein involved in a signal 

CC transduction pathway. The compound also inhibits the interaction between 

CC the androgen receptor and a protein selected from Smad3, Smad4 , Akt, 

CC transforming growth factor (TGF) -B and phosphatase and tensin homologues 

CC deleted on chromosome 10 (PTEN) or their fragments. The compounds of the 

CC invention have cytostatic and antiproliferative activity. The obtained 

CC composition is useful for treating any disease, where uncontrolled 

CC proliferation or cellular proliferation occurs such as cancer, e.g. 

CC prostate cancer. This sequence represents the androgen receptor 

CC transactivation signalling pathway modulator E00973 described in 

CC the method of the invention. 

XX 

SQ Sequence 2527 BP; 472 A; 888 C; 735 G; 432 T; 0 Other; 

Query Match 79.8%; Score 955; DB 25; Length 2527; 
Best Local Similarity 88.4%; Pred. No. 9.3e-193; 

Matches 1058; Conservative 0; Mismatches 115; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III 1 1 M II ! 1 1 1 1 1 1 1 1 1 1 lllllllllllll 1 1 1 1 1 1 1 II 1 1 1 II || Ml 

Db 842 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 MINIMI MMI MIIIIIIIIIMI II 1 1 1 1 1 1 1 1 1 1 II I 

Db 9 02 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 



Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllllllll lllllllllll Mill III 
i i ! ' i i i i i i i i i i i i i i i i i i i i i i iiiiMiiiii i i i i i i i i i I i iiiii i i i 

GTG AAG CGGAAG CG CAT CG AGG C CAT C CG CGG C CAGAT C CTGT C CAAGCTG CGG CT CGC C 


180 


Db 


962 


1021 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 


240 


Db 


1022 


II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 lllll II II II II 
AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTC 


1081 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

MINIM II 1 II 1 1 1 1 1 II 1 1 1 II II 1 II 1 1 1 1 1 M II II 1 1 II II II 1 1 Ml 

llllllll llllllllllllll llllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 


300 


Db 


1082 


1141 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGGAACCAAATC 


360 


Db 


1142 


M II 1 II 1 II II II II 1 1 II II II II 1 II 1 II 1 1 1 1 1 II II 1 1 II 1 MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


1201 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

1 1 1 II II 1 1 II 1 1 1 1 1 II 1 1 1 llllllll 1 1 1 II II 1 1 II 1 1 1 II 1 
iiiii ii i i i i i i i i i i [ i i i iiiiiiii i iiiiiiii ii i i i i i i 

TATGACAAGTTCAAGCAGAGTACACACAGCATATATA 


420 


Db 


1202 


1261 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


480 


Db 


1262 


II llllllll II lllll II llllllll 1 1 II II 1 M II II 1 II II II 1 M 1 1 1 
CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


1321 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAG CTATACCAGAAATACAGCAATGATTCCTGG CGC 


540 


Db 


1322 


1 1 II II II II II II 1 1 II II 1 1 II 1 1 II 1 1 II 1 1 II II II 1 II 1 II II II 1 1 1 1 1 1 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i iiifiiiiii 

AAGTTAAAAGTGGAG(^GC^CGTGGAGCTGTAC(^GAAATAC^GCAACAATTCCTGGCGA 


1381 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


600 


Db 


1382 


II II 1 1 1 1 M II II 1 II M 1 II 1 lllll lllll II 1 M 1 II 1 II M 1 II II 1 1 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 


1441 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

1 1 II 1 II II 1 1 1 II 1 II II II III III III II II Mill llllllll M 
ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


660 


Db 


1442 


1501 


Qy 


661 


GCCCACTCTTCCTCTGAOVGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 


720 


Db 


1502 


1 1 II 1 1 1 II 1 1 llllllll II II II M II 1 II lllll II II 1 II II 1 1 1 1 
i i i i i i i i i i i iiiiiiii iiiiiiiiiii ii i i i i 1 ii iiiiiiiiii i 

GCCCACTGCTCCTGTGACAGC^GGGATAAC^C^ 


1561 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

1 llllllll 1 1 1 II 1 1 II II II II 1 II II llllllllllllll II II II II IN 
i iiiiiiii i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i iiiiiiii mi 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


780 


Db 


1562 


1621 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 

II II II 1 II 1 II 1 II 1 1 II 1 II 1 II 1 1 II Mill MMIMMMMMIMMIMI 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCnCCGAGCCCTG 


840 


Db 


1622 


1681 


Qy 


841 


GATACCAACTACTGCTT(^GCTC(^CGGACTAC^AGGATGACGACGACAAGGAGAAGAAC 

II llllllll 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 MMMMII 

GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 


900 


Db 


1682 


1717 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 


960 


Db 


1718 


MMMMMMMMI 1 ! 1 1 1 1 1 1 1 1 1 II 1 llllllll IMMMMMIM II 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


1777 



Qy 961 GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 102 0 

II I , I II I I I I! ! I I I I I I I I llllllll IMIMII llllllll llllllll 
Db 1778 GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 1837 

Qy 1021 GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 108 0 

mmi ii 1 1 1 1 1 1 iii : ; 1 1 1 1 1 ii 1 1 1 1 1 : i ii 1 1 1 r iiiiiiuiii Mini 

Db 1838 GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 1897 

Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

MMMMMMMMMMMMIMIMM M II II M I M M M M II M M II M 

Db 18 98 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 1957 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I II M M M 1 1 1 M M 1 1 

Db 1958 AAGCCCAAGGTGGAGGAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2 014 



RESULT 11 




AAQ56923 




t 

1 U 


AAQ56923 standard; cDNA / 2537 BP. 


vv 

AA 






AL 


AAQ56923; 




Y Y 
AA 






DT 


25-MAR-2003 


(updated) 


U 1 


09-JUL-1994 


(first entry) 


vv 
AA 






JJCj 


Human pre-TGF 


-beta-1 . 


VV 

AA 






KW 


TGF-beta-1; TGF-beta-2; transforming growth factor beta-1; 


KW 


transforming growth factor beta -3; recombinant; wound heal 


KW 


vulnerary; ss 




XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Location/Qualifiers 


FT 


misc_structure 47 . . 113 


FT 




/*tag= a 


FT 




/note= "possible hairpin loop region" 


FT 


CDS 


842 . .2014 


FT 




/*tag- b 


FT 


mat_j?eptide 


1676. .2011 


FT 




/*tag= c 


FT 


polyA_signal 


2515. .2521 


FT 




/*tag= d 


XX 






PN 


US5284763-A. 




XX 






PD 


08-FEB-1994 . 




XX 






PF 


04-MAR-1992; 


92US-0845893 . 


XX 






PR 


22-MAR-1985; 


85US-0715142 . 


PR 


13-MAR-1987; 


87US-0025423 . 


PR 


04 -AUG- 198 9; 


89US-0389929. 


PR 


04-MAR-1992; 


92US-0845893. 


XX 






PA 


(GETH ) GENENTECH INC. 



XX 

PI Derynk RMA, Goeddel DV; 
XX 

DR WPI; 1994-056343/07. 

DR P-PSDB; AAR46227. 
XX 

PT Nucleic acid sequences encoding transforming growth factor-beta - 

PT diagnostic probes, and for use in therapeutics 

XX 

PS Disclosure; Fig lb; 25pp; English. 
XX 

CC cDNA sequences were determined for human pre-TGF-beta- 1 (AAQ56923) , 

CC pig TGF-beta-3 (AAQ56925) and human TGF-beta-3 (AAQ56926) , and the 

CC corresponding amino acid sequences were determined (AAR46227-29 , 

CC respectively) . A genomic fragment corresponding to a human TGF- 

CC beta-1 exon (AAQ56924) was also isolated and its amino acid sequence 

CC determined (AAR46230) . The sequences have been used in the 

CC construction of vectors for the expression of recombinant TGF- 

CC beta . 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 2537 BP; 473 A; 890 C; 742 G; 432 T; 0 other; 



Query Match 79.8%; Score 955; DB 15; Length 2537; 

Best Local Similarity 88.4%; Pred. No. 9.3e-193; 

Matches 1058; Conservative 0; Mismatches 115; Indels 24; Gaps 1; 



Qy 


i 


ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 

III MM II IMMIMIM IMIMMIMM llllllllllllll II III 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 


60 


Db 


842 


901 


Qy 


61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


120 


Db 


902 


M 1 1 1 II 1 II M 1 IMMIIM Mill II II 1 1 1 II II M 1 1 II M M 1 M 1 II II 

CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 


961 


Qy 

Db 


121 

962 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

1 1 1 1 1 f 1 1 1 1 1 r f f f 1 1 1 1 j 1 1 1 1 r i iiiMiiiiii iiiiiiiiiii Mill mi 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 


180 
1021 


Qy 
Db 


181 
1022 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

MIIIIIIIIIIIMMMI 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 II 1 Mill II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 


240 
1081 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

Mllllll IMIIIMMIIM Mllllll MM MM MMMMMM Ml 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACGGGAGCCCGAGCCTGAG 


300 


Db 


1082 


1141 


Qy 
Db 


301 
1142 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

II i 1 1 1 1 II 1 1 1 II M 1 1 1 1 II M 1 1 M 1 1 1 1 1 1 II II M 1 1 1 M 1 MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


360 
1201 


Qy 


361 


TATGATAAATT CAAGGG CAC CCCCCACAG CTTATATATG CTGTTCAACACGT CGGAG CTC 


420 


Db 


1202 


Mill M MIDI 1 1 MUM Mllllll 1 llllllll M MIMI 

TATGACAAGTTCAAGCAGAGTACACACAGCAT^ 


1261 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

II llllllll II Mill II Mllllll MMMIMIMM MMMMMM 


480 



Db 


1262 


CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


1321 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


540 


Db 


1322 


IIIIIIIIIIIIIMIIIIMIIIIIIII IIIIIIIMIMIIIII llllllllll 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 


1381 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 


600 


Db 


1382 


IIIIIIIIIIIIIIIMIIIIII Mill Mill II MUM 1 1 1 IMIMM 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 


1441 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 


660 


Db 


1442 


IMIMIIIIMMMIMM III III III II M MM! IMIMM II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


1501 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 


720 


Db 


1502 


MIMM MM IMIMM IMIIMMM II Mill 1 1 MMMMII 1 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 


1561 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 


780 


Db 


1562 


1 IMIMM II II M 1 1 M M 1 M M II 1 MMMMMIMI IMIMM Ml 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


1621 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 


840 


Db 


1622 


1 M M II M M II 1 M M 1 M M 1 1 M 1 1 Mill II M M II 1 II 1 M M 1 M II II 1 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 


1681 


Qy 


841 


GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 

1 1 IMIMM MMMMMIMI MMMMII 

GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 


900 


Db 


1682 


1717 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

MMMMIMMMM IMMIMIMIM IMIMM IMIIMIIIMM 1 1 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


960 


Db 


1718 


1777 


Qy 
Db 


961 
1778 


GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 

II 1 1 II 1 1 M 1 1 11 II 1 1 ! 1 1 1 MMMM M M II II IMIMM IMIMM 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 


1020 
1837 


Qy 
Db 


1021 
1838 


GACACTCAGTACAG(^GGTCCTGGCTCTGTA(^CCAGCACAACCCGGGCGCGTCGGCG 

Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II IMMIIMM Mill! 

GA(^CGCAGTA(^GCAAGGTCCTGGCCCTGTACAACaVGCATAACCCGGGCGCCTCGGCG 


1080 
1897 


Qy 
Db 


1081 
1898 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 

II II M II M 1 M M 1 1 M 1 II M M M M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 


1140 
1957 


Qy 


1141 


AAGCCC^GGTGGAGCAGCTGTC(^CATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II II 1 II 1 1 1 1 1 1 1 II II 1 1 MM!!!!!!!!!!!!!! 

AAGCCCAAGGTGGAG(^GCTGTC(^CATGATCGTGCGCTCCTGCAAGTGCAGCTGA 2 014 


Db 


1958 



RESULT 12 
AAN81084 

ID AAN81084 standard; cDNA; 1560 BP. 
XX 

AC AAN81084; 
XX 



DT 25-MAR-2003 (updated) 

DT 09-OCT-1990 (first entry) 

XX 

DE Coding sequence of simian transforming growth factor-beta 1. 
XX 

KW Transforming growth factor-beta 1; tumour treatment; ss cDNA. 
XX 

OS Cercopithecus aethiops . 
XX 

FH Key Location/Qualifiers 

FT CDS 261. . 1433 

FT /*tag= a 

FT sig_peptide 282.. 323 

FT /*tag= b 

FT mat_peptide 1095.. 1433 

FT /*tag= c 

XX 

PN EP293785-A. 
XX 

PD 07-DEC-1988. 
XX 

PF 27-MAY-1988; 88EP- 0108528 . 
XX 

PR 29-MAY-1987; 87US- 0055662 . 

PR 25-JAN-1988; 88US- 014 7842 . 
XX 

PA (ONCO ) ONCOGEN. 

PA (BRIM ) BRISTOL-MYERS CO. 

XX 

PI Purchio AG, Gentry L, Twardzik D; 
XX 

DR WPI; 1988-347488/49. 

DR P-PSDB; AAP80647. 
XX 

PT Prodn. of simian transforming growth factor beta-1 - by culturing 

PT transfected eucaryotic cells, and new precursor proteins, useful for 

PT treating tumours. 
XX 

PS Disclosure; Page ?; pp; English. 
XX 

CC The cDNA is prepd . from African green monkey cell line BSC-40 and is 

CC expressed in eukaryotic cells in plasmid pSV2 . There is 100% homology 

CC between mature simian and human TGF-beta 1. The plasmid also contains 

CC the SV40 promoter and a selection marker, esp. DHFR. 

CC (Updated on 25-MAR-2003 to correct PA field.) 

CC (Updated on 25-MAR-2003 to correct PI field.) 

XX 

SQ Sequence 1560 BP; 301 A; 547 C; 445 G; 267 T; 0 other; 

Query Match 79.6%; Score 953.4; DB 9; Length 1560; 

Best Local Similarity 88.3%; Pred. No. 1.9e-192; 

Matches 1057; Conservative 0; Mismatches 116; Indels 24; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM II I MM Mill I I Mill Mill II Mill MUM II I II III 

Db 2 61 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 



Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

MINIMI MIMMIIMM Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllllll MINI 

Db 321 CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATCGAGCTG 380 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 Mill III 

Db 381 GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 44 0 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

f 1 1 1 1 1 f i r 1 1 1 1 1 1 1 1 1 1 1 i m 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 Mm ii ii ii ii 

Db 441 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 5 00 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

1 1 1 1 1 M I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMM 1 1 1 1 II IMIMMMI M Ml 

Db 501 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 56 0 

Qy 3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 6 0 

ii 1 1 1 1 1 1 1 1 1 1 1 j i r i r 1 1 1 1 1 1 1 1 r 1 1 m 1 1 1 1 1 1 1 ( 1 1 1 1 1 1 i mm imm 

Db 561 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 62 0 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

Mill II MMM I I I MUM MMMM I MMMM II llllll 

Db 621 TATGAC^GTTCAAGCAGAGCACACAC^GCATATATATGTTCTTCAACACATCAGAGCTC 680 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 480 

II Mill II II MMMM MMMM IMIMMIIIIM IMIMIMIII 

Db 681 CGAGAAGCAGTACCTGAACCTGTGTTGCTCTGCCGGGCAGAGCTGCGTCTGCTGAGGCTC 74 0 

Qy 481 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

IIMMIMM MMMM MMMM 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 IMIMIMI 

Db 741 AAGTTAAAAGTCGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 8 00 

Qy 541 TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 6 00 

1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill MM MMMM MM IMIIIIM 

Db 8 01 TACCTCAGCAACCGGCTGCTGGCGCCCAGCAACTCGCCGGAGTGGTTGTCTTTTGATGTC 8 60 

Qy 601 ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTGGCCTCAGT 6 60 

M M M I M M II M M I II I Ml MM Ml II 1 1 Mill MMMM II 

Db 861 ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 92 0 

Qy 661 GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 72 0 

llllll! MM 1 1 f f 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 i II Mill II MMIIIMI I 

Db 921 GCCCACTGCTCCTGTGACAGCAAAGATAAO^CACTGCAAGTGGACATCAACGGGTTCACT 980 

Qy 721 TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 78 0 

I llllllll MMMIMIMM Mill IMMMIMMM MMMM III 

Db 981 ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 104 0 

Qy 781 ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 84 0 

M M II I M II I M I II I M 1 1 M M II Mill 1 1 II 1 1 1 M II II II II 1 1 II M I 

Db 1041 ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 1100 

Qy 841 GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 9 00 

IN M 1 1 II 1 1 1 1 III 1 1 il II 1 1 1 IMIIIIM 

Db 1101 GACACCAACTACTGCTTCAGCTCCAC GGAGAAGAAC 1136 

Qy 901 TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 



Illllllllllllllll II lllllllllll llllllll llllllllllllll II 

TGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 1196 



Db 


1137 


TGCTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


1196 


Qy 


961 


GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 


1020 






II 1 1 1 1 1 1 1 1 M 1 1 1 1 !l 1 II! IIIIMIIIIIIIIIIIIMMIIII llllllll 




Db 


1197 


GAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTG 


1256 




J. \J ± 


GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 


1080 






Mill IIIIIIIIIIIMIIMIII IIIMIIIIIIIII lllllllllll Mill 




Db 


1257 


GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 


1316 


Qy 


1081 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 


1140 


Db 


1317 


1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M 1 II II ! 1 1 1 M 1 1 1 1 M 1 II II 1 1 1 1 1 1 1 1 1 1 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 


1376 


Qy 


1141 


AAGCCCAAGGTGGAGCAGCTGTCCAACATGATGGTGCGTTCCTGCAAGTGCAGCTGA 1197 






IIIIIMIMIIIIIIIIIIIMMIIIIIIIIIMM Mill II lllllllll 




Db 


1377 


AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGAAAATGCAGCTGA 1433 



RESULT 13 
AAQ03508 

ID AAQ03508 standard; DNA; 1560 BP. 
XX 

AC AAQ03508; 
XX 

DT 25-MAR-2003 (updated) 
DT 09-JAN-2003 (updated) 
DT 14-AUG-1990 (first entry) 
XX 

DE Simian Transforming growth factor - Betal. 
XX 

KW HIV; AIDS; SIV; vaccine; AZT; CD 4 ; cytokines; growth 

KW factors; ds . 

XX 

OS Cebus apella. 
XX 

FH Key Location/Qualifiers 

FT CDS 267. . 1437 

FT /*tag= a 

FT mat_peptide 1103.. 1437 

FT /*tag= b 

XX 

PN EP356935-A. 
XX 

PD 07-MAR-1990. 
XX 

PF 25-AUG-1989; 8 9EP- 0 1 1 571 9 . 
XX 

PR 25-AUG-1988; 88US- 0236698 . 
XX 

PA (ON CO ) ONCOGEN LP. 
XX 

PI Brankovan V, Lioubin M, Purchio A; 
XX 

DR WPI; 1990-068723/10. 
DR P-PSDB; AAR05663. 



XX 

PT Compsns . contg. transforming growth factor beta - 

PT used for inhibitions of HIV infection and replication in vivo. 

XX 

PS Disclosure; Fig 1; 20pp; English. 
XX 

CC TGF-beta may be used in vivo to prevent formation of synctia and 

CC inhibit HIV infection. TGF may also be used with other HIV treatments 

CC (AZT, soluble CD4 etc.). 

CC (Updated on 09-JAN-2003 to add missing OS field.) 

CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 1560 BP; 301 A; 547 C; 445 G; 267 T; 0 other; 

Query Match 79.6%; Score 953.4; DB 11; Length 1560; 

Best Local Similarity 88.3%; Pred. No. 1.9e-192; 

Matches 1057; Conservative 0; Mismatches 116; Indels 24; Gaps 1; 

ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III MM M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMMMIMI II Ml 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 32 0 
CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

1 1 1 ! 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 Mill I! II 1 1 1 1 1 1 1 1 1 1 II M II M llllll 

CTGACGCCTAGCCGGCCGGCCGCAGGACTATCCACCTGCAAGACTATCGACATCGAGCTG 38 0 
GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM II II II II II I IMMMMM Mill Ml 

GTGAAGCGGAAGCGCATCGAGACCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 4 4 0 
AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

IIIIIIIIIIIMIIIMM 1, 1 1 1 ! M M 1 1 M I M 1 1 1 Mill II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 5 00 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

1 1 1 1 1 1 1 1 I II M I II II II II 'ML! MM II IMMMMM M III 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCGGAGCCGGAGCCCGAACCGGAG 560 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

II II II I II 1 1 II I II I M M 1 1 II 1 1 II II II I M I M M 1 1 II I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 62 0 
TATGATAAATTC^GGGCACCCCCCAC^GCTTATATATGCTGTTO^CACGTCGGAGCTC 42 0 

Mill II MUM I I I MUM MIIIMI I IMIIMI II llllll 

TATGACAAGTTCAAGCAGAGCACACA 68 0 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 

II Mill II II MIIIMI MIIIMI MMMMMIMI MM MM! Ml 

CGAGAAGCAGTACCTGAACCTGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 74 0 
AAGTTAA^GTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

lllllllllll MIIIMI llllll IMIMMMIMIIM IMIMIIII 

AAGTTAAAAGTCGAGCAGCATGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 8 00 



Qy 


1 


Db 


261 


Qy 


61 


Db 


321 


Qy 


121 


Db 


381 


Qy 


181 


Db 


441 


Qy 


241 


Db 


501 


Qy 


301 


Db 


561 


Qy 


361 


Db 


621 


Qy 


421 


Db 


681 


Qy 


481 


Db 


741 


Qy 


541 


Db 


801 



Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

IIIMIIIIIIIIIIMIMI III MM Ml II II Mill MMMM II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGCGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


660 


Db 


861 


920 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

lllllll MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II Mill 1 1 MMMM 1 I 

GCCCACTGCTCCTGTGACAGCAAAGATAACACACTGCAAGTGGACATCAACGGGTTCACT 


720 


Db 


921 


980 


Qy 
Db 


721 
981 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

1 MMMM MMMMMMM MM M MMMMMMM MMMM Ml 

ACCGGCCGCCGAGGTGACCTGGCCACAATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


780 
1040 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 

1 f I J 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II Mill IIIIIMIIMIIIIIIIIIMII 

ATGGCCACCCCGCTGGAGAGGGCCCAACATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 


840 


Db 


1041 


1100 


Qy 


841 


GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 

II 1 1 ! 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 MMIIIMI 

GACACCAACTACTGCTTCAGCTCCAC GGAGAAGAAC 


900 


Db 


1101 


1136 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

MIIMIIIMMIIII M MMMM MMMM MMMMMMM M 

TGGTGCGTGCGGCAGCTGTATATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


960 


Db 


1137 


1196 


Qy 


961 


GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 

M 1 J 1 1 1 1 1 1 J I j 1 1 r f i r 1 1 1 IIMMIIIIIIIIIIIIIIIIMII MMMM 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTGGGGCCCTGTCCCTACATTTGGAGCCTG 


1020 


Db 


1197 


1256 


Qy 


1021 


GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 

Mill MMMMIMMMIMM MMMMMMM' IMIMIMM MIMI 

GACAGGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 


1080 


Db 


1257 


1316 


Qy 


1081 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 

1 1 1 M 1 1 II 1 1 1 1 1 M 1 M 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 II 
GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 


1140 


Db 


1317 


1376 


Qy 


1141 


AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

1 1 II II 1 1 II 1 1 II 1 1 M 1 II II 1 1 1 1 M M 1 1 1 1 1 1 1 Mill II MMMM 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGAAAATGCAGCTGA 14 33 


Db 


1377 



RESULT 14 




AAV52933 




ID 


AAV52933 standard; cDNA; 2537 BP. 




XX 






AC 


AAV52933; 




XX 






DT 


25-MAR-2 003 (updated) 




DT 


21-DEC-1998 (first entry) 




XX 






DE 


Human pre-transforming growth factor-beta 1 


CDNA. 


XX 






KW 


Transforming growth factor-beta 1; TGF-beta 


1; human; ss 


XX 






OS 


Homo sapiens. 




XX 







FH Key Location/Qualifiers 

FT CDS 842 . .2014 

FT /*tag= a 

FT mat_peptide 1676.. 2011 

FT /*tag= b 

FT stem_loop 37 . . 113 

FT " /*tag= b 

FT /note= "putative stable hairpin loop" 

FT misc_feature 2015.. 2100 

FT /*tag= c 

FT /note= "GC-rich sequence" 

FT polyA_signal 2514.. 2520 

FT /*tag= d 

XX 

PN US5801231-A. 
XX 

PD 01-SEP-1998. 
XX 

PF 30-MAY-1995; 95US- 0454468 . 
XX 

PR 13-MAR-1987; 87US- 002 5423 . 

PR 22-MAR-1985; 85US- 0715142 . 

PR 04-AUG-1989; 89US-0389929 . 

PR 04-MAR-1992; 92US-0845893 . 

PR 05-NOV-1993; 93US-0147364 . 

PR 30-MAY-1995; 95US-0454468 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA , Goeddel DV; 
XX 

DR WPI; 1998-494840/42. 

DR P-PSDB; AAW78785. 
XX 

PT DNA encoding transforming growth factor-beta precursor sequence - 

PT useful for analysis to perform manipulations to increase yield of 

PT recombinant production of the protein 
XX 

PS Example 3; Fig IB 1-3; 2 6pp; English. 
XX 

CC This nucleotide sequence codes for the human transforming growth 

CC factor-beta 1 precursor {preTGF-beta 1, see AAW78785) . It is a 

CC composite of overlapping cDNA clones isolated from different cDNA 

CC libraries (placenta, A172 glioblastoma, HT1080 f ibroblastoma) using 

CC TGF-beta exon (see AAV52936) restriction fragments as probes. 

CC The 3' region of the sequence was determined using cloned genomic 

CC DNA. The invention relates to the recombinant production of 

CC TGF-beta. Biologically active TGF-beta is defined as being capable 

CC of inducing EGF-potent iated anchorage independent growth of target 

CC cell lines and/or growth inhibition of neoplastic cell lines. 

CC Nucleic acids encoding TGF-beta have been isolated and cloned into 

CC vectors which are replicated in bacteria and expressed in 

CC eukaryotic cells. TGF-beta recovered from transformed cells is 

CC used in known therapeutic applications. TGF-beta nucleic acids are 

CC also useful in diagnosis and identification of TGF-beta clones. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 



SQ Sequence 2537 BP; 475 A; 895 C; 736 G; 431 T; 0 other; 



Query Match 79.5%; Score 951.8; DB 19; Length 2537; 

Best Local Similarity 88.2%; Pred. No. 4.4e-192 ; 

Matches 1056; Conservative 0; Mismatches 117; Indels 24; Gaps 1; 

ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM M MIMMIMI lllllllllllll I i 1 1 1 1 1 1 1 1 1 1 1 1 II III 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 
CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

iiiiiiiiiiiii iiiiiii i Mill Miiiiiiiiiiii 1 1 1 r i f 1 1 r r i r r i 

CTGACGCCTGGCCCGCCGGCCCCGGGACTATCCACCTGCAAGACTATCGACATGGAGCAG 961 



Qy 


l 


Db 


842 


Qy 


61 


Db 


902 


Qy 


121 


Db 


962 


Qy 


181 


Db 


1022 


Qy 


241 


Db 


1082 


Qy 


301 


Db 


1142 


Qy 


361 


Db 


1202 


Qy 


421 


Db 


1262 


Qy 


481 


Db 


1322 


Qy 


541 


Db 


1382 


Qy 


601 


Db 


1442 


Qy 


661 


Db 


1502 


Qy 


721 


Db 


1562 



AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

MMMMIMMIMIMI " 1 1, ;,| || 1 1 1 1 1 1 1 1 Mill II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 1081 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

II M 1 1 II llllllllllllll IIIIIMI I II I 1 1 1 1 r 1 1 1 i I f 1 1 1 1 1 1 Ml 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 1141 
GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 60 

M 1 1 1 1 1 1 1 11 1 II I M M 1 1 1 M 1 1 1 II M 1 1 1 1 II II I II I II I MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 12 01 
TATGATAAATTC^GGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill II MIMI I I MMM IIIIIMI I IIIIIMI II MIMI 

TATGACAAGTTCAAGCAGAGTACACACAGCATAT^ 1261 
CGGG AAG CGGTG C CGGAAC CTGTATTG CTCTCT CGGG CAGAG CTG CG CCTG CTGAGG CTC 4 80 

II IIIIIMI M Mill II IIIIIMI llllllllllllll MINIMUM 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 1321 
AAGTTAAAAGTGG AG CAG CA CG TGGAG CTATA CCAGAAATA CAG CAATGATT CCTGG CG C 54 0 

IIIIIMIIIIIMIMIIIIIIIMIM MMMIMMMIMI . ; 1 1 1 1 1 1 1 : 

AAGTTAAAAGTGGAGC^GCACGTGGAGCTGTACCAGAAATACAGO^CAATTCCTGGCGA 1381 
TAC CTCAG CAAC CGG CTG CTGG C C CC CAGTGACTCAC CGGAGTGG CTGTCCTTTGATGT C 600 

i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ; 1 1 1 Mill Mill M MIMI I II MINIMI 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 1441 



1 ) 1 1 1 i 1 1 1 1 1 1 1 i 1 1 1 1 M 1 III III IN I II Mill MM!!!! !! 



GCCC^CTCTTCCTCTGA^GOU^GATAAC^C^CTCCACGTGGAAATTAACGGGTTa^AT 72 0 

IIIIIII MM IIIIIMI NIMININ II lllll II MINIUM I 

GCCC^CTGCTCCTGTGACAGC^GGGATAAC^CACTGCAAGTGGAaVTCAACGGGTTCACT 1561 



I MII'll, I < I i 1 1 1 M.M 1 1 1 , 1 1 1 1 INNNMINN IIIIIMI II 



Qy 


781 


Db 


1622 


Qy 


841 


Db 


1682 


Qy 


901 


Db 


1718 


Qy 


961 


Db 


1778 


Qy 


1021 


Db 


1838 


Qy 


1081 


Db 


1898 


Qy 


1141 


Db 


1958 



I MM MM 1 1 1 1 II I II 1 1 1 Ml 1 1 1 ll Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 
GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 

II MINIM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MIMMIII 

GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

I I I; 1 1 II 1 1 1 1 1 II I i II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II M I M I M I II 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 
GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 

II I M 1 1 II 1 1 II M M I M II IMIMII IMIIMI MMMM 1 1 1 f 1 1 1 1 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 
GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCG 

Mill 1 1 1 1 1 1 i 1 1 1 , 1 1 , ! I IMIIIIIIIIIII 1 1 1 1 1 1 1 1 1 1 1 MUM 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 
GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 If 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 Ml I M II 1 1 1 1 1 1 1 1 II 1 1 1 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 
AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1 1 

M M I MM 1 1 1 1 1 1 1 1 1 1 II I II I Ml 1 1 1 1 1 1 Ml , 1 1 1 1 1 ; 1 1 1 1 1 1 1 1 1 1 



RESULT 15 
ABV75391 

ID ABV75391 standard; DNA; 1176 BP. 
XX 

AC ABV7 53 91; 
XX 

DT 18-MAR-2003 (first entry) 
XX 

DE TGFB1 Arg25Pro polymorphism G-allele nucleotide sequence. 
XX 

KW Polymorphism; transforming growth factor beta 1; TGFB1; TGFbeta-1; 

KW renal failure; nephrotropic; human; gene; ds . 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT CDS 1 . . 1176 

FT /*tag= a 

XX 

PN WO200290585-A2 . 
XX 

PD 14-NOV-2002. 
XX 

PF 08-MAY-2002; 2 0 02WO-GB02 066 . 
XX 

PR 09-MAY-2001; 2 001GB- 0011277 . 
XX 



PA (UYSH-) UNIV SHEFFIELD HALLAM . 
XX 

PI El-Nahas AM, Blakemore A, Khalil MS; 
XX 

DR WPI; 2003-120560/11. 

DR P-PSDB; ABB82780. 
XX 

PT Determining an individual's susceptibility to the progression of renal 

PT failure comprises detecting the presence of a genetic polymorphism 

PT pattern in transforming growth factor beta 1 (TGFB1) gene in a sample 

PT from the individual 
XX 

PS Claim 49; Page 58; 62pp; English. 
XX 

CC The invention relates to determining an individual's susceptibility to 

CC renal failure and invovles detecting the presence of a genetic 

CC polymorphism pattern in transforming growth factor beta 1 (TGFB1) gene in 

CC a sample from an individual, where polymorphism pattern is associated 

CC with renal failure. The method is useful for determining an individual's 

CC susceptibility to the progression of renal failure. The nucleic acid 

CC comprising a T(-509)C polymorphism of TGFB1 gene, or a polypeptide 

CC comprising a sequence of 391 amino acids is useful for preparing a 

CC medicament for retarding or preventing the progression of renal disease, 

CC and for drug research purposes for retarding or preventing the 

CC progression of renal disease. Sequences ABV75386-88 represents the 

CC nucleotide sequence for the TGFB1 G-allele of the Arg25Pro polymorphism 

CC of exon 1. 

XX 

SQ Sequence 1176 BP; 239 A; 381 C; 354 G; 202 T; 0 other; 



Query Match 79-0%; Score 945.2; DB 25; Length 1176; 

Best Local Similarity 88.3%; Pred. No. 9.8e-191; 

Matches 1060; Conservative 0; Mismatches 113; Indels 27; Gaps 2; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III I I I I II 1,11,: IIMIIIIIIIII llllllllllllll II III 
Db 1 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 60 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

li II 1 1 1 III 1 1 1 ! 1 1 1 1 1 II I Mill IIMIIIIIIIIII 1 1 1 1 1 1 1 '1 1 1 1 1 II 

Db 61 CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 12 0 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

i 1 I I , i . 1 1 lil IIIIIIIMII Mill III 

Db 121 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 18 0 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

II II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 h 1 1 1 I II M; II 1 , 1 1 1 1 1 Mill II II M M 

Db 181 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 24 0 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

IMMMI IIMIIIIIIIIII 1 1 1 1 1 f 1 1 MM 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 Ml 

Db 241 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 3 00 

Qy 3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 60 

1 1 M M M I M M M M M M M I M 1 1 M M M M M M II M I I 1 1 1 1 MMI 

Db 3 01 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 360 



Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Mill II llllll 1 1 llllll llllllll 1 llllllll II llllll 
TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 


420 


Db 


361 


420 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT- - -GAGG 

II llllllll II IIMI M llllllll llllllllllllll Mill MM 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGAGG 


477 


Db 


421 


480 


Qy 


478 


CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGG 


537 


Db 


481 


1 1 1 1 1 1 ; 1 1 1 1 1 1 1 1 1 1 M i ! M ! 1 1 1 1 1 i 1 1 i M ! 1 1 1 1 1 ! | IIMI! MMMM 

CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGG 


540 


Qy 


538 


CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGAT 

II IMIIMIIIIIIMIIIIIIII Mill Mill II MUM 1 II llllll 

CGATACCTCAGCAACCGGCTGCTGGCAGGCAGCGACTCGCCAGAGTGGTTATCTTTTGAT 


597 


Db 


541 


600 


Qy 


598 


GTCACCGGAGTTGTGCGGCAGTGGGTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 

IMIIIIIMIIIIIIIIIIIIII III III III M II Mill llllllll 

GTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTT 


657 


Db 


601 


660 


Qy 


658 


AGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTC 

M lllllll MM llllllll MMIIIMM M Mill II lllllllll 

AGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTC 


717 


Db 


661 


720 


Qy 


718 


AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGGCCTTCGTGCTC 

1 1 1 llllllll 1 1 1 1 1 1 1 r 1 1 1 1 ! 1 1 1 1 1 1 1 llllllllllllll MMMM 

ACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTT 


777 


Db 


721 


780 


Qy 


778 


CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCC 

1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 ! i , < Mill MMIIMIIIIMIIIIMI 

CTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCC 


837 


Db 


781 


840 


Qy 


838 


CTGGATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAG 

Mill MMMM llllllllllllll lllllll 

CTGGACACCAACTATTGCTTCAGCTCCAC GGAGAAG 


897 


Db 


841 


876 


Qy 


898 


AACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATT 

IMIIIMIIIIIIIIIMI MIMMMMIM MMMM IMIMMMIIM 

AACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATC 


957 


Db 


877 


936 


Qy 


958 


(^TGAACC(^GGGCTACC^TGC(^ATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGC 

II M MIMIMMMIMIMM llllllll MMMM MMMM llllll 

CACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGC 


1017 


Db 


937 


996 


Qy 


1018 


CTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCG 
II 1 1 II 1 llilllllllllllllllll llllllllllllll lllllllllll III 

! . 1 II II MIMIMMMIMIMM 1 II M II II M II 1 MMIIIMM III 

CTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCG 


1077 


Db 


997 


1056 


Qy 


1078 


GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGC 

1 M 1 M II 1 M 1 1 1 1 1 II 1 M II II 1 II II 1 M 1 1 : 1 1 i 1 ! I II 1 llllll 1 ! 1 1 1 ! 1 

GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGC 


1137 


Db 


1057 


1116 


Qy 


1138 


CGCAAGCCC^GGTGGAG(^GCTGTC(^(^TGATCGTGCGTTCCTGa^GTGCAGCTGA 

1 M 1 1 1 M 1 M 1 II 1 M 1 1 1 1 II 1 1 1 M 1 1 II II 1 1 II M 1 MIIMMMIMIIMI 

CG(^GCC(^AGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 


1197 


Db 


1117 


1176 



Search completed: October 27, 2003, 19:11:02 
Job time : 367.617 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2 0 03 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: October 27, 2003, 18:35:27 ; Search time 355.784 Seconds 

(without alignments) 
9022.658 Million cell updates/sec 

Title: US-10-017-372E-12 
Perfect score: 1197 

Sequence: 1 atggcgccttcggggctgcg gttcctgcaagtgcagctga 1197 



Scoring table: 
Searched: 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

1792395 seqs, 1340900451 residues 



Total number of hits satisfying chosen parameters: 3584790 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_NA: * 

1 : /cgn2_6/ptodata/2/pubpna/US07_PUBCOMB . seq : * 

2 : /cgn2_6/ptodata/2/pubpna/PCT__NEW_PUB. seq: * 

3 : /cgn2_6/ptodata/2/pubpna/US06_NEW_PUB. seq: * 

4 : /cgn2_6/ptodata/2/pubpna/US06_PUBCOMB . seq: * 

5 : /cgn2_6/ptodata/2/pubpna/US07_NEW_PUB . seq : * 

6 : /cgn2_6/ptodata/2/pubpna/PCTUS_PUBCOMB . seq : * 

7 : /cgn2_6/ptodata/2/pubpna/US08_NEW_PUB . seq : * 

8 : /cgn2_6/ptodata/2/pubpna/US08_PUBCOMB. seq: * 

9 : /cgn2_6/ptodata/2/pubpna/US09A_PUBCOMB. seq: * 
10: / cgn2_6 /pt oda t a / 2 /pubpna /US 0 9B_PUBCOMB . seq : * 
11 : /cgn2_6/ptodata/2/pubpna/US09C_PUBCOMB . seq : * 
12 : /cgn2_6/ptodata/2/pubpna/US09_NEW_PUB.seq:* 
13: / cgn2_6 /p t oda t a / 2 /pubpna /US 1 0A_PUBCOMB .seq:* 
14 : /cgn2_6/ptodata/2/pubpna/US10B_PUBCOMB.seq: * 
15 : /cgn2_6/ptodata/2/pubpna/US10_NEW_PUB. seq: * 
16 : /cgn2_6/ptodata/2/pubpna/US60_NEW_PUB . seq : * 
17 : /cgn2_6/ptodata/2/pubpna/US60_PUBCOMB. seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



% 
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ALIGNMENTS 



RESULT 1 
US-10-087-268-1 

; Sequence 1, Application US/10087268 
; Publication No. US20030119010A1 



GENERAL INFORMATION: 
APPLICANT: Jonsonn, Julie Ruth 
APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

FILE REFERENCE: Fibrosis 

CURRENT APPLICATION NUMBER : US/10/087,268 
CURRENT FILING DATE: 2002-03-01 
NUMBER OF SEQ ID NOS : 6 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 1 
LENGTH: 1821 
TYPE: DNA 
ORGANISM: Human 
FEATURE : 

NAME /KEY: 5 ' UTR 
LOCATION: (1) . . (511) 
OTHER INFORMATION: 
NAME /KEY: CDS 
LOCATION: (512) . . (1684) 
OTHER INFORMATION: 
NAME/ KEY: sig_peptide 
LOCATION: (512) . . (598) 
OTHER INFORMATION: 
NAME /KEY: 3 ' UTR 
LOCATION: (1685) . . (1821) 
OTHER INFORMATION: 
US-10-087-268-1 

Query Match 80.1%; Score 958.2; DB 14; Length 1821; 

Best Local Similarity 88.6%; Pred. No. 1.5e-257; 

Matches 1060; Conservative 0; Mismatches 113; Indels 24; Gaps 1; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III I I I I II lllllllllll IIIIIIIIIMM llllllllllllll II III 
Db 512 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 571 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

Illllllllllllllllllllll Mill llllllllllllll I'M,,' 
Db 572 CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 631 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

IMIIIIIIMIMIIMIIIIMII lllllllllll M M I II II M IMM Ml 

Db 632 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 691 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

MIIIIIIIMIIIIIIIII MMMMIMMMIMM Mill II II II II 

Db 692 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 751 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 0 0 

llllllll 1 1 1 1 1 J r 1 1 f 1 1 1 f II M I M I MM 1 1 1 1 1! I M 1 1 1 1 1 1 1 1 III 

Db 752 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 811 

Qy 301 GCGGACTACTACGCCAAGGAGGTC^CCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 3 60 

M M M 1 1 1 II M M 1 1 M II M M M I M M 1 1 M 1 1 II M M I I MM Mill 

Db 812 GCCGACTACTACGCC^AGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 871 



Qy 


361 


TATGATAAATTCAAGGG(^CCCCCCACAGCTTATATATGCTGTTO^CACGTCGGAGCTC 

Mill II llllll 1 1 llllll llllllll 1 llllllll II llllll 
TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 


420 


Db 


872 


931 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

II llllllll II Mill II MMMII IMMMMMMI 1 1 1 1 1 1 1 1 1 1 1 1 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


480 


Db 


932 


991 


Qy 


481 


AAGTTAAAAGTGGAGCAGC^CGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 


540 


Db 


992 


IMIMMMMMIMMMMMMM IMMMMMMMM 1 II 1 II M II 

AAGTTAAAAGTGGAGC^GCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 


1051 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

Illllllllllllllllllllll Mill Mill II llllll 1 M lllllllll 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 


600 


Db 


1052 


1111 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

1 1 II 1 1 1 1 M II II 1 1 1 II II III III Ml II II Mill MMMII II 
ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


660 


Db 


1112 


1171 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

[llllll MM MMMII MIMMMM M Mill 1 1 MMMMM 1 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 


720 


Db 


1172 


1231 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

1 MMMII MMMMMMIMIMM MMIMMIMM MMMII Ml 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


780 


Db 


1232 


1291 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 

1 1 1 M 1 II 1 M 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 Mill IMIIIIIMIIIMIIIIIIMI 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTG 


840 


Db 


1292 


1351 


Qy 


841 


GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 

II MMMII llllllllllllll MMMMM 

GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 


900 


Db 


1352 


1387 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

MIIIIIIIIIIMIII llllllllllllll MMMII IIIIIIIIIMIII II 

TG CTGCGTG CGG CAG CTGTACATTGACTT C CG CAAGGAC CTCGG CTGGAAGTGGAT C CAC 


960 


Db 


1388 


1447 


Qy 


961 


GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 

II IIIIIIIIIIIIIIMMM llllllll MMMII MMMII llllllll 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 


1020 


Db 


1448 


1507 


Qy 


1021 


GAC^CTCAGTACAGCAAGGTCCTGGCTCTGTACAACC^GCAC^ACCCGGGCGCGTCGGCG 

Mill IIMMIMIMMMIMI llllllllllllll MIMMMM MUM 

GACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCGGCG 


1080 


Db 


-L \J o 


1567 


Qy 


1081 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 

1 M 1 II 1 II II 1 1 II M 1 1 M 1 M 1 M 1 M II i , 1 1 1 1 

GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGC 


1140 


Db 


1568 


1627 


Qy 


1141 


AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

1 1 M 1 1 1 1 1 1 II 1 1 1 1 II II M II 1 1 M 1 1 1 1 1 1 1 1 1 1 IMIIIIIIIIMIIII 

AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1684 


Db 


1628 



RESULT 2 

US-10-037-270-220 

Sequence 220, Application US/10037270 
Publication No. US20030104529A1 
GENERAL INFORMATION : 
APPLICANT: Tang, Y. Tom 
APPLICANT: Liu, Chenghua 
APPLICANT: Asundi, Vinod 
APPLICANT: Zhang, Jie 
APPLICANT: Ren, Feiyan 
APPLICANT: Chen, Rui-hong 
APPLICANT: Zhao, Qing A. 
APPLICANT: Wehrman, Tom 
APPLICANT: Xue, Aidong J . 
APPLICANT: Yang, Yonghong 
APPLICANT: Wang, Jian-Rui 
APPLICANT: Zhou, Ping 
APPLICANT: Ma, Yunqing 
APPLICANT: Wang, Dunrui 
APPLICANT: Wang, Zhiwei 
APPLICANT: Tillinghast , John 
APPLICANT: Drmanac, Radoje T. 

TITLE OF INVENTION : No. US2 0 03 01 04 52 9Alel Nucleic Acids and 
TITLE OF INVENTION: Polypeptides 
FILE REFERENCE: 784CIP2B 

CURRENT APPLICATION NUMBER: US/10/037,270 
CURRENT FILING DATE: 2002-01-04 
PRIOR APPLICATION NUMBER: 09/552,317 
PRIOR FILING DATE: 2000-04-25 
PRIOR APPLICATION NUMBER: 09/488,725 
PRIOR FILING DATE: 2000-01-21 
NUMBER OF SEQ ID NOS : 1104 
SOFTWARE: pt_FL_genes Version 1.0 
SEQ ID NO 22 0 
LENGTH: 2 742 
TYPE : DNA 

Homo sapiens 



ORGANISM: 
FEATURE: 
NAME /KEY: 
LOCATION: 



CDS 
(842) . 



(2014) 



US-10-037-270-220 



Query Match 80.1%; Score 958.2; DB 14; Length 2742; 

Best Local Similarity 88.6%; Pred. No. 1.7e-257; 

Matches 1060; Conservative 0; Mismatches 113; Indels 24; Gaps 



1; 



Qy 
Db 

Qy 

Db 

Qy 

Db 



1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III 1 1 1 1 II IMIIIMIII 1 1 1 1 1 1 M 1 11 M llllllllllllll II IN 

842 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 

61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

MIM I M Ml Mill 1 1 Ml M MMI IMIIIIIIIIIII MM M MM 1 1 1 1 II 

9 02 CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 

121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 180 

MM I M MM M M , MM MM i IIIIIIIIIII IIIIIIIIIII Mill III 

962 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 1021 



Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

1,11111 , 1 '1 1 M.llill IMIIII II Mill II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 


240 


Db 


1022 


1081 


Qy 


241 


TACAACAGTACCCGCGAGCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

lllllill IIIIIIIIMIMI lllllill MM mm II II 1 III 
TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 


300 


Db 


1082 


1141 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGGTAATGGTGGAAAGCGGCAACCAAATC 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


360 


Db 


1142 


1201 


Qy 


361 


TATGATAAATTCAAGGGCACCCGCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Mill II llllll 1 1 IMMI lllllill 1 lllllill II IIMM 

TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 


420 


Db 


1202 


1261 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

II lllllill II Mill II lllllill llllllllllllll MINIMUM 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


480 


Db 


1262 


1321 


Qy 


481 


AAGTTAAAAGTGG AG CAG CA CGTGGAG CT ATAG CAGAAATACAG CAATG ATTC CTGG CG C 

1 1 1 1 1 f f 1 1 1 1 1 1 r i r 1 1 f j i r 1 1 1 1 1 1 1 immmiiimi imiiimm 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 


540 


Db 


1322 


1381 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGAGTCACCGGAGTGGCTGTCCTTTGATGTC 

IIIIIIIIIIIIIIIMIIIMI Mill Mill II llllll 1 II III 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 


600 


Db 


1382 


1441 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

MMMIIMMIIMIMM III III III M II Mill lllllill II 

ACCGGAGTTGTGGGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


660 


Db 


1442 


1501 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

IMIIII MM lllllill MIMIIMM II Mill II Illlllllll 1 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCAGT 


720 


Db 


1502 


1561 


Qy 


721 


TCTGGCGGCGGGGGTGACCTGGGCACCATTCACGGCATGAACCGGCCCTTCCTGCTGCTC 

1 lllllill 1 1 1 1 1 1 1 1 1 1 1 M 1 I 1 1 'I , llllllllllllll MMMM Ml 
ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGGCTTTCCTGCTTCTC 


780 


Db 


1562 


1621 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTGCCGGCACCGCCGAGCCCTG 

II II 1 1 II 1 1 II II II II II II II 1 II II Mill MIIMIIIIIIIIIIMIIIIII 

ATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTGCCGGCACGGCCGAGCCCTG 


840 


Db 


1622 


1681 


Qy 


841 


GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t I I I l l I I I I 

I I II II 1 1 II 1 M M 1 M II M 1 1 M 1 1 1 II 1 1 1 

GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 


900 


Db 


1682 


1717 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

IMMIMIIMIMM 1 1 1 1 J 1 1 J f r 1 1 1 1 MMMM IMIMIIIIMM M 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


960 


Db 


1718 


1777 


Qy 


961 


GAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTA 

M IMIIIIIIIIIIIIIIIII MMMM MMMM lllllill MMMM 

GAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTG 


1020 


Db 


1778 


1837 



Qy 


1021 


Dh 

YJVJ 


1 QOQ 

1 o J o 


Qy 


1081 


Db 


1898 


Qy 


1141 


Db 


1958 



Mill llllllilllllllllllll llllllllllllll lllllllllll Mill 

GACACGCAGTA(^GCA^GGTCCTGGCCCTGTAC^ACCAGCATAACCCGGGCGCCTCGGC( 
GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCG( 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



RESULT 3 
US-10-087-268-4 

; Sequence 4, Application US/10087268 

; Publication No. US2003 0119010A1 

; GENERAL INFORMATION: 

; APPLICANT: Jonsonn, Julie Ruth 

; APPLICANT: Powell, Elizabeth Ellen 

; TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

; FILE REFERENCE: Fibrosis 

; CURRENT APPLICATION NUMBER: US/10/087 , 268 

CURRENT FILING DATE: 2002-03-01 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 4 

LENGTH: 1821 

TYPE: DNA 

ORGANISM: Human 

FEATURE : 

NAME /KEY: 5 » UTR 

LOCATION: (1) . . (511) 

OTHER INFORMATION: 

NAME /KEY: CDS 

LOCATION: (512) . . (1684) 

OTHER INFORMATION: 

NAME/KEY: sig_peptide 

LOCATION: (512) . . (598) 

OTHER INFORMATION: 

NAME /KEY: 3 • UTR 

LOCATION: (1685) . . (1821) 

OTHER INFORMATION: 
US-10-087-268-4 

Query Match 79.9%; Score 956.6; DB 14; Length 1821; 

Best Local Similarity 88.5%; Pred. No. 4.3e-257; 

Matches 1059; Conservative 0; Mismatches 114; Indels 24; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

ill 1 1 1 1 II lllllllllll lllllllllllll llllllllllllll ll III 

Db 512 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 571 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 120 

1 1 1 1 1 1 M 1 1 f 1 1 1 1 1 1 1 1 1 1 1 Mill I f I f 1 1 1 1 1 1 1 1 1 1 

Db 572 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 631 



Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

IMIIIIIIIIIIIIIIIIMIIIII IIIIIIIIMI IIIIIIIMII Mill III 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 


180 


Db 


632 


691 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 


240 


Db 


692 


IIIIMIIMIIIIIMIM MMMIIIIIMIIIIMI Mill M M M M 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCGGAGGCCGTGCTCGCGCTG 


751 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

llllllll MIMMMMMI llllllll MM 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 Ml 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 


300 


Db 


752 


811 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

M MMIIIIMIIMMMMMIIIMMIIIMIIIIIMM 1 MM Mill 

GGCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


360 


Db 


812 


871 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Mill M II M II 1 1 MUM MMMM 1 MMMM M IMMI 

TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 


420 


Db 


872 


931 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGGCTGCTGAGGCTC 


480 


Db 


932 


II MMMM II Mill II MMMM IMMIIIMMM MIMMIMM 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


991 


Qy 


481 


AAGTTAAAAGTGGAGCAGCAGGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

IIIIIMIIIIIIIMIIIMIIIIIMI IMIMMIIMMIM Illllillll 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 


540 


Db 


992 


1051 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

MMMMIMMMMMIMI Mill Mill II IMMI 1 II IMIIMM 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 


600 


Db 


1052 


1111 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

II 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 III III III M II Mill MMMM II 

AGCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


660 


Db 


1112 


1171 


Qy 


661 


GGCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 


720 


Db 


1172 


lllllll MM MMMM MMMIMM II Mill M Illllillll 1 

GCCCACTGCTGCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 


1231 


Qy 


721 


TCTGGCCGCGGGGGTGACCTGGCCACCATTCAGGGCATGAAGCGGCCCTTCCTGCTCCTC 

1 MMMM IIMIIIIIIIIIIIIIIII llllllllllllll MMMM Ml 

ACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTC 


780 


Db 


1232 


1291 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 II Mill II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


840 


Db 


1292 


1 II 1 M 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 II II 1 1 Mill 1 1 II 1 II 1 1 II II II II 1 1 1 II II 

ATGGCCACCCCGCTGGAGAGGGCGCAGCATCTGCAAAGGTCCCGGCACCGCCGAGCGCTG 


1351 


Qy 


841 


GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 

II MMMM llllllllllllll Illllillll 

GACACCAACTATTGCTTCAGCTCCAC GGAGAAGAAC 


900 


Db 


1352 


1387 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

MMMIMM, MM, MIMMMMMI MMMM llllllllllllll M 

TGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCAC 


960 


Db 


1388 


1447 



Qy 


961 


Db 


1448 


Qy 


1021 


Db 


1508 


Qy 


1081 


Db 


1568 


Qy 


1141 


Db 


1628 



II IIIIIIIMIIIIIIIIIII III INI MINIM MMMM MMMM 

GAG C CCAAGGG CTAC CATG C CAA CTT CTG C CT CGGG CCCTG CCC CTACATTTGGAGCCTC 
GACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAAGCCGGGCGCGTCGGC( 

MIM MMMMMMMMMM MMMMMMM I ! 1 1 1 1 1 1 1 f I MMI 



MMMMMMMMMMMMMMMMMMM MMMMMMMMI 



RESULT 4 

US-09-948-002-28 

; Sequence 28, Application US/09948002 

; Publication No. US20030050265A1 

; GENERAL INFORMATION: 

; APPLICANT: Nicholas M . Dean 

; APPLICANT: Susan F. Murray 

; TITLE OF INVENTION : ANTI SENSE MODULATION OF TRANSFORMING GROWTH 
; TITLE OF INVENTION: FACTOR BETA EXPRESSION 
; FILE REFERENCE: ISPH-0607 

; CURRENT APPLICATION NUMBER: US/ 0 9 / 94 8 , 0 02 

; CURRENT FILING DATE: 2000-09-05 

; PRIOR APPLICATION NUMBER: 09/661,753 

; PRIOR FILING DATE: 2000-09-14 

; PRIOR APPLICATION NUMBER: 60/154,546 

/ PRIOR FILING DATE: 1999-09-17 

; NUMBER OF SEQ ID NOS : 71 

; SEQ ID NO 28 

LENGTH: 274 5 

TYPE: DNA 
; ORGANISM: Homo sapiens 

FEATURE : 

NAME /KEY: CDS 

LOCATION: (842) . . . (2017) 
US-09-948-002-28 

Query Match 78.8%; Score 943.6; DB 11; Length 2745; 

Best Local Similarity 88.2%; Pred. No. 2.1e-253; 

Matches 1059; Conservative 0; Mismatches 114; Indels 27; Gaps 2; 

Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

Ml MM II lllllllllll IMIIIMIMM llllllllllllll II III 
Db 842 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 901 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

IMMMMMM MINIMI IMM IMIIIININN 1 1 1 II 1 1 1 1 i II 1 1 1 

Db 902 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 961 



Qy 



121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

II I II 1 1 1 1 1 1 II 1 1 1 1 1 II M II 1 1 lllllllllll Mill III 



Db 


962 


GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 


1021 


Qy 


181 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 


240 


Db 


1022 


IliiliM IhM Mill II M II II M II M 1 1 M M Mill II II II II 

AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 


1081 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


300 


Db 


1082 


MINIM MMMMMMM MMMM MM 1 M II M M II M II 1 1 Ml 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 


1141 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


360 


Db 


1142 


M IMMMMMMMMMMIMMMMMIMMIMMI 1 MM Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


1201 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 


Db 


1202 


Mill II MMM 1 1 1 M 1 II MMMM 1 MMMM II MIMI 

TATGACAAGTTC^GCAGAGTAC^CAC^GCATATATATGTTCTTC^ACACATCAGAGCTC 


1261 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT GAGG 


477 


Db 


1262 


M MMMM II Mill M MMMM MMMMMMM IMM MM 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGAGG 


1321 


Qy 


478 


CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGG 

IMMIMMMMMMMMMIMMMI II 1 1 1 M 1 1 1 1 1 1 M 1 1 MMMM 

CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGG 


537 


Db 


1322 


1381 


Qy 


538 


CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGAT 


597 


Db 


1382 


M MMMMMMIMMMMM Mill Mill M MIMI 1 II MINI 

CGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGAT 


1441 


Qy 


598 


GTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 

MIIIMMMMMMIIIIMI III III III II II Mill MINIM 

GTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTT 


657 


Db 


1442 


1501 


Qy 


658 


AGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAAGGGGTTC 

II 1 'ill UN MMMM INININN II INN II MINIM! 

AGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTC 


717 


Db 


1502 


1561 


Qy 

Db 


718 
1562 


AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTC 

1 1 1 llllllll 1 1 1 1 1 1 1 1 1 1 1 1 1 !i Ml II N N II 1 II II 1 1 1 llllllll 

ACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTT 


777 
1621 


Qy 
Db 


778 
1622 


CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCC 

1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 INN M 1 1 1, 1 1 1 1 1 1 II 1 Ml 1 II 

CTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCG 


837 
1681 


Qy 


838 


CTGGATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAG 
INN llllllll NNNININN Illllll 


897 


UD 


i con 


1717 


Qy 
Db 


898 
1718 


AACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATT 

II 1 II 1 N II N 1 1 1 II 1 N NNINNINN llllllll NNNININN 

AACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATC 


957 
1777 


Qy 


958 


CATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATGTGGAGC 

n ii , 1 1 , 1 1 1 1 1 1 ; i 1 1 1 1 1 1 1 : mmmm iiiiiiii iiiiiiii Him 

CACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGC 


1017 


Db 


1778 


1837 



Qy 1018 CTAGACACTCAGTACAGCAkGGTC^ 1077 

II Mill llllllllllllllllllll llllllllllllll lllllllllll III 

Db 1838 CTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCG 1897 

Qy 1078 GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGC 1137 

I II ! 1 1 1 1 '1 1 1 1. 1 1 1 1 1 II M I II 1. 1 1 1 1 1 1 1 . 1 1 1 MM Til 1 1 1 1 1 M 1 1 1 II I 

Db 1898 GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGC 1957 

Qy 113 8 CGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

1 1 J i 1 1 ! I 1 1 1 i ' [ 1 1 1 i i i 1 1 1 1 1 lllllll 1 1 1 1 1 1 llllllllllllllllll 

Db 1958 CG CAAG CCCAAGGTGG AG CAG CTGTCCAACATGAT CGTG CG CT CCTG CAAGTG CAG CTG A 2017 



RESULT 5 

US-09-948-002-27 

; Sequence 27, Application US/09948002 

; Publication No. US20030050265A1 

; GENERAL INFORMATION: 

; APPLICANT: Nicholas M. Dean 

; APPLICANT: Susan F. Murray 

; TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH 
; TITLE OF INVENTION: FACTOR BETA EXPRESSION 
; FILE REFERENCE: ISPH-0607 

/ CURRENT APPLICATION NUMBER: US/09/948,002 

; CURRENT FILING DATE: 2000-09-05 

; PRIOR APPLICATION NUMBER: 09/661,753 

; PRIOR FILING DATE: 2000-09-14 

; PRIOR APPLICATION NUMBER: 60/154,546 

; PRIOR FILING DATE: 1999-09-17 

; NUMBER OF SEQ ID NOS : 71 

; SEQ ID NO 27 

LENGTH: 1585 

TYPE : DNA 

ORGANISM: Rattus norvegicus 
FEATURE : 
NAME /KEY: CDS 
LOCATION: ( 4 13 ) . . . ( 1 5 8 5 ) 
US-09-948-002-27 

Query Match 71.4%; Score 854.2; DB 11; Length 1585; 

Best Local Similarity 83.1%; Pred. No. 1.7e-228; 

Matches 995; Conservative 0; Mismatches 178; Indels 24; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

Ml 1 1 1 1 MIMMMIMM lllllll Mill II II I MUM IIIIM 

Db 413 ATGCCGCCCTCGGGGCTGCGGCTGCTGCCGCTTCTGCTCCCACTCCCGTGGCTTCTAGTG 4 72 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

llllllll M MM Mill Mill II 1 1 M I II M Ml MM 1 1 1 1 II 1 1 1 Ml 

Db 4 73 CTGACGCCCGGGAGGCCAGCCGCGGGACTCTCCACCTGCAAGACCATCGACATGGAGCTG 532 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

Mill I M II I II II II M Mill II llllllll llllllll II MM III 

Db 533 GTGAAACGGAAGCGCATCGAAGCCATCCGTGGCCAGATCCTGTCCAAACTAAGGCTCGCC 5 92 



Qy 



181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 



Db 


593 


Qy 


241 


Db 


653 


Qy 


301 


Db 


713 


Qy 


361 


Db 


773 


Qy 


421 


Db 


833 


Qy 


481 


Db 


893 


Qy 


541 


Db 


953 


Qy 


601 


Db 


1013 


Qy 


661 


Db 


1073 


Qy 


721 


Db 


1133 


Qy 


781 


Db 


1193 


Qy 


841 


Db 


1253 


Qy 


901 


Db 


1289 


Qy 


961 


Db 


1349 


Qy 


1021 



II IIIIIIIIIIIIMIII II Mill MINIUM Mill II II Ml I 

AGTCCCCCGAGCCAGGGGGAGGTACCGCCGGGCCCGCTGCCCGAGGCGGTGCTCGCTTTG 652 
TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

MMIMI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II M II I II II I II M I M 1 1 II III 

TACAACAGCACCCGCGAGCGGGTGGCAGGCGAGAGCGCTGACCCGGAGCCCGAGCCCGAG 712 
GCGGACTAGTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

1 1 1 1 1 1 1 r r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 r 1 1 r 1 1 1 n mm mm 

GCGGACTAGTACGCCAAAGAAGTCACCCGCGTGCTAATGGTGGACCGCAACAACGCAATC 772 
TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 42 0 

Mill III Ml I II I I Mill MMIMI I Mill Mill M I 

TATGACAAAACCAAAGACATCACACACAGTATATATATGTTCTTCAATACGTCAGACATT 832 
CGGGAAGCGGTGCCGGAAGCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 80 

MMIMI Mill Mill II II 1 1 M M MMIMMMIMM III II 

CGGGAAGCAGTGCCAGAACCCCCATTGCTGTCCCGTGCAGAGCTGCGCCTGCAGAGATTC 8 92 
AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

MM II IIIIIIII Mill M II 1 1 1 llllil I i Mill MINIUM 

AAGTCAACTGTGGAGCAACACGTAGAACTCTACCAGAAATATAGCAACAATTCCTGGCGT 952 
TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 6 00 

Mill I IIIIIUIH, MINI III I N MIM, MM INN III 

TACCTTGGTAACCGGCTGCTGACCCCCACTGATACGCCTGAGTGGCTGTCTTTTGACGTC 1012 
ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 660 

II IIIIIIII MMMMIMM II INI I III INI WWW 1 1 1 1 1 

ACTGGAGTTGTCCGGCAGTGGCTGAACCAAGGAGACGGAATACAGGGCTTTCGCTTCAGT 1072 

GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 720 

II INI II I lllllllllllllll I I I I I I I I I I I I II III III I 

GCTCACTGCTCTTGTGACAGCAAAGATAATGTACTCCACGTGGAAATCAATGGGATCAGT 1132 



I II II llllllllll Illlll II I llllllill I I I I M I I M II I II 



NNIIIINI INN INN II lllllllllllllll III 



GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 9 00 

1 1 m 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 iiiiiNii 

GATACCAACTACTGCTTCAGCTCCAC AGAGAAGAAC 12 8 8 

TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 960 

INN N millll NNIIIINI 1 1 1 1 1 1 M 1 1 1 1 I NININNI II 

TG CTGTGTACGGCAG CTGTACATTGACTTTAGGAAGGAC CTGGGTTGGAAGTGGAT CCAC 134 8 



GACACTCAGTACAGGAAGGTCCTGGCTCTGTACM 1080 

inn 1 1 1 1 1 1 1 1 1 1 n 1 1 1 1 1 ii n iiiiiiii ii 1 1 ii ii ii i ii ii n 



Db 



14 0 9 GACACACAGTACAGCAAGGTC 14 68 



Qy 1081 GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 114 0 

I IMIMIIIIIMIIIIIII MIMIMIIIIIIIMM Mlltllllll III 

Db 14 69 TCACCGTGCTGCGTGCGGCAGGCTTTGGAGCCACTGCCCATCGTCTACTACGTGGGTCGC 1528 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

llllllllllllllllll I I I II I I I I I I I I I I II I I llllllllllllllllll 
Db 1529 AAGCCCAAGGTGGAGCAGTTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 1585 



RESULT 6 
US-09-948-002-1 

Sequence 1, Application US/09948002 
Publication No. US20030050265A1 
GENERAL INFORMATION: 
APPLICANT: Nicholas M. Dean 
APPLICANT: Susan F . Murray 

TITLE OF INVENTION: ANTISENSE MODULATION OF TRANSFORMING GROWTH 
TITLE OF INVENTION: FACTOR BETA EXPRESSION 
FILE REFERENCE: ISPH-0607 

CURRENT APPLICATION NUMBER: US/09/94 8,002 
CURRENT FILING DATE: 2000-09-05 
PRIOR APPLICATION NUMBER : 09/661,753 
PRIOR FILING DATE: 2000-09-14 
PRIOR APPLICATION NUMBER: 60/154,546 
PRIOR FILING DATE: 1999-09-17 
NUMBER OF SEQ ID NOS : 71 
SEQ ID NO 1 
LENGTH: 2 094 
TYPE: DNA 

ORGANISM: Mus musculus 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (868) . . . (2040) 
US-09-948-002-1 

Query Match 70.8%; Score 847.8; DB 11; Length 2094; 

Best Local Similarity 82.8%; Pred. No. l.le-226; 

Matches 991; Conservative 0; Mismatches 182; Indels 24; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

III MM II III Ml II Ml I Mill 1 1 lllll II II M Mill I Ml M 

Db 868 ATGCCGCCCTCGGGGCTGCGGCTACTGCCGCTTCTGCTCCCACTCCCGTGGCTTCTAGTG 927 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

Illlllll 1 1 MM Mill Mill 1 1 1 1 1 : , 1 1 1 < 1 1 1 i 1 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 

Db 92 8 CTGACGCCCGGGAGGCCAGCCGCGGGACTCTCCACCTGCAAGACCATCGACATGGAGCTG 987 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

lllll IMMMMMIM Mill 1 1 Illlllll Illlllll II MM III 

Db 988 GTGAAACGGAAGCGCATCGAAGCCATCCGTGGCCAGATCCTGTCCAAACTAAGGCTCGCC 104 7 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

M Mill MMMIMM II IMMMMMMMM Mill M II Ml I 

Db 1048 AGTCCCCCAAGCCAGGGGGAGGTACCGCCCGGCCCGCTGCCCGAGGCGGTGCTCGCTTTG 1107 



Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

MINIM MMMMMMM 1 1 1 1 M M 1 Ml M IMM Mill II 

TA CAACAGCACCCG CGACCGGGTGG CAGG CGAGAG CG C CGA CCCAGAG CCGGAG C CCGAA 


300 


Db 


1108 


1167 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


360 


Db 


1168 


IIIIIIIIIII II M 1 1 M 1 1 M 1 1 1 ! 1 1 1 1 1 II 1 1 1 1 1 1 1 II MM III 

GCGGACTACTATGCTAAAGAGGTCACCCGCGTGCTAATGGTGGACCGCAACAACGCCATC 


1227 


Qy 


361 


TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 

Mill III III Mill Mill MINIM 1 Mill INN II 1 

TATGAGAAAACCAAAGACATCTCACACAGTATATATATGTTCTTCAATACGTCAGACATT 


420 


Db 


1228 


1287 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

1 1 1 r 1 1 1 1 INN INN llllll II N 1 1 1 1 1 1 1 1 1 1 1 1 III III 1 

CGGGAAGCAGTGCCCGAACCCCCATTGCTGTCCCGTGCAGAGCTGCGCTTGCAGAGATTA 


480 


Db 


1288 


1347 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

II 1 II llllli II INN N IIIIIIIIIII INN I'll M, 

AAATCAAGTGTGGAGCAACATGTGGAACTCTACCAGAAATATAGCAACAATTCCTGGCGT 


540 


Db 


1348 


1407 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

INN 1 'IIIIIIIIIII llllll III 1 II IIIIIIIIIII INN III 

TACCTTGGTAACCGGCTGCTGACCCCCACTGATACGCCTGAGTGGCTGTCTTTTGACGTC 


600 


Db 


1408 


1467 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

II IIIINII NININNIN II INI 1 III INI INN INI 

ACTGGAGTTGTACGGCAGTGGCTGAACCAAGGAGACGGAATACAGGGCTTTCGATTCAGC 


660 


Db 


1468 


1527 


Qy 


661 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

II INI II 1 IIIIIIINIIIIINI lllllllllllllll llllll IN 

GCTCACTGCTCTTGTGACAGCAAAGATAACAAACTCCACGTGGAAATCAACGGGATCAGC 


720 


Db 


1528 


1587 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 


780 


Db 


1588 


1 II INN lllllll llllll II 1 1 M III 1 1 M 1 1 II 1 1 1 1 1 1 1 1 III 

CCCAAACGTCGGGGCGACCTGGGCACCATCCATGACATGAACCGGCCCTTCCTGCTCCTC 


1647 


Qy 


781 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 

ININNIN Mill 1 M 1 1 1 1 II 1 1 1 ' II '1 1 II III IIIINII IIIINII 

ATGGCCACCCCCCTGGAAAGGGCCCAGCACCTGCACAGCTCACGGCACCGGAGAGCCCTG 


840 


Db 


1648 


1707 


Qy 


841 


GATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAAC 

IIIIIIIIIII llllllllllllll Illllllll 
GATACCAACTATTGCTTCAGCTCCAC AGAGAAGAAC 


900 


Db 


1708 


1743 


Qy 


901 


TGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCAT 

INN ININNIN IIIIIIIIIII 1 1 1 1 1 !' Mi 1 1 1 INIMIMN II 

TGCTGTGTGCGGCAGCTGTACATTGACTTTAGGAAGGACCTGGGTTGGAAGTGGATCCAC 


960 


Db 


1744 


1803 




961 


II II II II II II II II 1 1 II 1 1 INN INN INN INN II IIIINII 

GAGCCCAAGGGCTACCATGCCAACTTCTGTCTGGGACCCTGCCCCTATATTTGGAGCCTG 




Db 


1804 


1863 


Qy 


1021 


GACACTCAGTAC^GCAAGGTCCTGGCTCTGTAO^ACCAGCACAACCCGGGCGCGTCGGCG 

INN IIIIIIINIIIIINI II II IIIINII llllllllllllll llllll 

GACACACAGTACAGCAAGGTCCTTGCCCTCTACA^ 


1080 


Db 


1864 


1923 


Qy 


1081 


GCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGC 


1140 



I IMIIIIIIIIIIIIIIIII lllllllllllllllllll lllllllllll III 

Db 1924 TCACCGTGCTGCGTGCCGCAGGCTTTGGAGCCACTGCCCATCGTCTACTACGTGGGTCGC 1983 

Qy 1141 AAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IIIIIIMIIIIIIIIII I II I MM Mill Mill II III MM I llllllll 

Db 1984 AAGCCCAAGGTGGAGCAGTTGTCCAACATGATTGTGCGCTCCTGCAAGTGCAGCTGA 2 04 0 



RESULT 7 

US-09-756-283A-19 

Sequence 19, Application US/09756283A 
Patent No. US20020151478A1 
GENERAL INFORMATION: 
APPLICANT: Cherna j ovsky , Yuti 
APPLICANT: Dreja, Hanna Stina 
APPLICANT: Adams, Gillian 
TITLE OF INVENTION: Latent Fusion Protein 
FILE REFERENCE: 0623.1000000 

CURRENT APPLICATION NUMBER : US/ 09/756 , 283A 
CURRENT FILING DATE: 2001-01-09 
NUMBER OF SEQ ID NOS : 100 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 19 
LENGTH: 13 76 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: LAP-mlFNbeta construct 
NAME/ KEY: CDS 
LOCATION: (1) . . (1368) 
US-09-756-283A-19 

Query Match 54.6%; Score 654; DB 10; Length 1376; 

Best Local Similarity 88.1%; Pred . No. 1.4e-172; 

Matches 724; Conservative 0; Mismatches 95; Indels 3; Gaps 1; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

Ml MM II 1 1 1 M II II II MMIMIIMM IMMMMIMM II Ml 

Db 1 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 60 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

IMIMIMMM lllllllll Mill llllllllllllll I I I I J I I I I I J I I I I 
Db 61 CTGACGCCTGGCCCGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 120 

Qy 121 GTG AAG CGGAAG CG CAT CGAGG CCATT CG CGG C CAGATT CTGT C CAAG CTTCGG CTTG CC 18 0 

IMIIIIIMIIIIIIMMIIIIII MIMIIMM IMIIIMIII Mill III 

Db 121 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 18 0 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

MMMMMMMMMM M 1 1 1 1 M 1 1 II M II II M Mill M II II M 

Db 181 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 24 0 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 300 

iiiiiiii ii ii i ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 mi 1 1 1 1 1 : 1 1 1 1 ; 1 1 ; in 

Db 241 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 300 



Qy 



3 01 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 



Db 



301 



II IMIIIIMMMIIIIIIIIIIMIIMIIIIMMIIMII I 1 1 1 1 IIIII 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 360 



TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

IIIII II IIIIII I I MUM IIIIIIII I IIIIIIII II MINI 

TATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 420 

CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT- - -GAGG 4 77 

II IIIIIIII II Mill II IMIIIII IIIIIIIIIIIIII Mill MM 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGAGG 48 0 



M II II II M 1 1 M II M II II I M II I II 



CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGAT 5 97 

II IIMIIMIIIIIIIIIIMIII IIIII IIIII M MUM I II MINI 

CGATACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGAT 600 
GTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 657 

IIIIIIIIIIIIIIIMIIIIIII Ml III III II II Mill I.I'M 

GTCAC CGGAGTTGTG CGG CAGTGGTTGAG CCGTGGAGGGGAAATTGAGGG CTTT CG CCTT 660 
AGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTC 717 

II lllllll MM IIIIIIII IMIMIMM II Mill M MMMIM 

AGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTC 720 



I I I IIIIIIII 1 1 ! 1 1 1 1 1 1 1 1 1 . 1 1 1 ! 1 1 1 1 1 1 M I ! ' 1 1 1 1 1 ! IIIIIIII 



CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGC 819 

i j i f f 1 1 1 r i f 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 iiiii mi 

CTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGC 822 



RESULT 8 

US-09-756-283A-21 

Sequence 21, Application US/09756283A 
Patent No. US20020151478A1 
GENERAL INFORMATION: 
APPLICANT: Chemajovsky, Yuti 
APPLICANT: Dreja, Hanna Stina 
APPLICANT: Adams, Gillian 
TITLE OF INVENTION: Latent Fusion Protein 
FILE REFERENCE: 0623.1000000 
CURRENT APPLICATION NUMBER: US/ 0 9/756 , 283A 
CURRENT FILING DATE: 2001-01-09 
NUMBER OF SEQ ID NOS : 100 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 21 
LENGTH: 1352 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: ml FNbeta -LAP construct 
NAME/ KEY: CDS 



Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


478 


Db 


481 


Qy 


538 


Db 


541 


Qy 


598 


Db 


601 


Qy 


658 


Db 


661 


Qy 


718 


Db 


721 


Qy 


778 


Db 


781 



LOCATION: (1) . . (1344) 
US-09-756-283A-21 



Query Match 49.4%; Score 591.4; DB 10; Length 1352; 

Best Local Similarity 87.0%; Pred. No. 4.1e-155; 

Matches 662; Conservative 0; Mismatches 96; Indels 3; Gaps 1; 



Qy 

Db 


63 
582 


GACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTGGT 

1 II II 1 1 II MM III MIMIIMMIM 1 Nil 1 II 

GG CGGGAGGGGGCTCAG CGG C CGCACTAT CCACCTG CAAG ACT ATCG ACATGG AG CTGGT 


122 
641 


Qy 

Db 


123 
642 


GAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 Mil l 1 Mill II Mill Mill 

GAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAG 


182 
701 


Qy 
Db 


183 
702 


CCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTA 

1 Mill IIMM 1 MM MMMIII 1 Mill II II II II II 

CCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTA 


242 
761 


Qy 


243 


CAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGC 

MIMI MIMIIMMIM 1 Ml MM 1 1 1 1 1 1 1 1 1 II 1 1 M II Mill 

CAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGC 


302 


Db 


762 


821 


Qy 


303 


GGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 It 1 MM MIMI 

CGACTACTACGCC^GGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTA 


362 


Db 


822 


881 


Qy 


363 


TGATAAATTC^GGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCG 


422 


Db 


882 


III II MIMI 1 1 MIMI 1 1 1 1 1 1 1 1 1 MMMM II 1 1 1 1 1 1 1 1 

TGACAAGTTCAAGCAGAGTACACACAG 


941 


Qy 


423 


GGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCT- - -GAGGCT 

MMMM II Mill II MMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill MIMI 

AGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGAGGCT 


479 


Db 


942 


1001 


Qy 


480 


CAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCG 

MIMIIIIMIIIMIIIIIIIIIIIIM IIIIIIMIIIIMMI 1 1 1 1 1 1 1 1 1 1 

CAAGTTAAAAGTGGAGCAGOVCGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCG 


539 


Db 


1002 


1061 


Qy 


540 


CTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGT 

1 II 1 Ml 1 IMM Ml II 1 1 Ml Mill Mill II IIMM 1 M MMMM 

ATArrTrjxnrAArrnnrTnrTnnrAPrrAnpnArTrnprAnAnTGr;TTATrTTTTnATr;T 


599 


Db 


1062 


112 1 


Qy 


600 


CACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 III III Ml II II Mill MMMM II 

CACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAG 


659 


Db 


1122 


1181 


Qy 


660 


TG CC CACTCTT CCTCTGA CAG CAAAGATAA CA CACTCCACGTGGAAATTAACGGGTT CAA 

MIMM MM MMMM 1 1 1 1 M 1 1 II 1 M Mill II MIMIIMI 

CGCCCACTGCTCCTGTGACAGCAGGGATAAC^ 


719 


Db 


1182 


1241 


Qy 


720 


TTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCT 


779 


Db 


1242 


1 1 MMMM IIIIIIMIIIIIIIIMM IMMMMIMM IMIIMI II 

TACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCT 


1301 


Qy 


780 


CATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCT 82 0 

IIMM 1 1 1 1 1 1 1 1 1 1 1 1 . MINI . 1 1 Mill MM 





Db 1302 CATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCT 1342 



RESULT 9 

US-09-911-904-167 

Sequence 167, Application US/09911904 
Publication No. US2003 0096234A1 
GENERAL INFORMATION : 
APPLICANT: Farr, Spencer B. 
APPLICANT: Pickett, Gavin G. 
APPLICANT: Neft, Robin Eileen 
APPLICANT: Dunn, II, Robert Thomas 
TITLE OF INVENTION: CANINE TOXICITY GENES 
FILE REFERENCE: 400742000200 
CURRENT APPLICATION NUMBER : US/09/911 , 904 
CURRENT FILING DATE: 2002-04-09 
PRIOR APPLICATION NUMBER: US 60/220,057 
PRIOR FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 3 86 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 167 
LENGTH: 48 9 
TYPE: DNA 

ORGANISM: Canis familiaris 
FEATURE : 

NAME / KEY : misc_f eature 
LOCATION: (1) . . . (489) 
OTHER INFORMATION: n = A, T, C or G 
US-09-911-904-167 

Query Match 28.6%; Score 342.4; DB 11; Length 4 89; 

Best Local Similarity 89.6%; Pred . No. l.le-85; 

Matches 389; Conservative 0; Mismatches 21; Indels 24; Gaps 1; 

Qy 764 GGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCC 823 

I I I, I I I I I I I I IK! I II 1 I I I I , I IIIIIIIIIIIIIIIMIIIIIIIIIIIMI 
Db 1 GACCCTTCCTGCTCCTCATGGCCACCCCACTGGAGAGGGCCCAGCACCTGCACAGCTCCC 60 

Qy 824 GGCACCGCCGAGCCCTGGATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACG 8 83 

I I I I Mill llllllll lllllllllllllllllllllll 
Db 61 GG CAG CG CCGGG C C CTGGACA C CAACT A CTG CTT CAG CT C CAC 103 

Qy 884 ACGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGG 943 

llllllllllllllllll lllllllllllllllllllllll IIIII MM 

Db 104 GGAGAAGAACTGCTGCGTCCGGCAGCTCTACATTGACTTCCGCAAGGATCTGG 156 

Qy 94 4 GCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTC 1003 

1 1 1 1 1 1 1 1 1 II 1 1 IIIII llllllll Mill II II 1 1 ! M I II M 1 1 1 1 1 1 1 I 

Db 157 GCTGGAAGTGGATCCATGAGCCCAAGGGTTACCACGCTAACTTCTGCCTGGGGCCCTGCC 216 

Qy 1004 CCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTG 1063 

III. Ml IIIIMM Mill MIIMMIIMIIIMIM MMMMMMIMI 

Db 217 CCTACATTTGGAGCCTGGA(^CGCAGTA»G(^GGTCCTGGCCCTGTACAAC(^GCACA 27 6 

Qy 1064 ACC CGGG CG CGTCGG CGG CG C CGTG CTG CGTG CCG CAGG CG CTGGAG C CACTG C C CAT CG 1123 

II I i 1 1 1 1 M I II 1 1 N M I M 1 1 1 M 1 1 1 M II 1 1 ! M t M M 1 1 i M I M 1 1 1 1 M M 

Db 277 ACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCG 336 



Qy 1124 TGTACTACGTGGGCCGC^GCCC^GGTGGAGCAGCTGTCaWVCATGATCGTGCGTTCCT 118 3 

I'll I II II M I II I II ; M II . 1 1 1 II II I II Ml 1 1 lllllllllllill Mil 

Db 337 TGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCGAACATGATCGTGCGCTCCT 396 

Qy 1184 G CAAGTG CAG CTGA 1197 

lllllllllllill 
Db 3 97 G CAAGTG CAG CTGA 410 



RESULT 10 
US-09-813-271B-1 

Sequence 1, Application US/09813271B 
Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 

biologically active protein 
NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US200201 15834Alart is Patent Department 
STREET: 564 Morris Avenue 
CITY: Summit 
STATE: New Jersey 

COUNTRY: USA 
ZIP : 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 9/8 13 , 2 71B 
FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/ 02 7 1 9 
FILING DATE: 12-Jul-95 
APPLICATION NUMBER: EPO 94810439.3 
FILING DATE: 25-Jul-94 
ATTORNEY/AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 
REGISTRATION NUMBER: 22640 

REFERENCE/DOCKET NUMBER: 4 -2 003 9C/C1C1 /USN 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (908) 522-6940 
TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 339 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
HYPOTHETICAL: NO 
IMMEDIATE SOURCE: 

CLONE: E. coli LC137/pPLMu.hTGF-betal (DSM 5656) 



FEATURE : 

NAME /KEY: CDS 
LOCATION: 1 . .336 

OTHER INFORMATION: /products "human TGF-betal" 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-09-813-271B-1 

Query Match 23.5%; Score 281.2; DB 10; Length 339; 

Best Local Similarity 94.2%; Pred. No. 1.3e-68; 

Matches 292; Conservative 0; Mismatches 18; Indels 0; Gaps 0; 
Qy 8 88 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 94 7 

II IIIIIIIMIIIIIIIIIIIIIIIIII Ml! I M I II 1 1 1 1 IIIMIII Mill 

Db 3 0 CACGGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTG 8 9 

Qy 94 8 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 1007 

II II || IIIIIIIIIMIIIMIIII IIIMIII IIIMIII Mill 

Db 9 0 GAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTA 14 9 

Qy 1008 CATCTGGAGCCTAGAC^CTCAGTAC^GCAAGGTCCTGGCTCTGTACAACCAGCAC^ACCC 1067 

III IIIMIII Mill I IIIIIMMI Ml Ml III IMIIMIIIIIII Mill 

Db 15 0 CATTTGGAGCCTGGAC^CGCAGTAC^GCXAGGTCCTGGCCCTGTACAACCAGCATAACCC 20 9 

Qy 1068 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 112 7 

MUM MM I M !M M M I MM II 1 1 I Ml I Mill II 1 1 1 II 1 1 1 1 1 1 1 1 1 

Db 210 GGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTA 2 69 



Qy 1128 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 118 7 

I ! 1 [ , I : I , I 1 1 ! 1 1 1 1 1 1 1 1 i ! 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I IIIMIII 

Db 270 CTACGTGGGCCGCAAGCCCAAGGTGGAGC^GCTGTCC^aVTGATCGTGCGCTCCTGCAA 32 9 

Qy 118 8 GTGCAGCTGA 1197 

IIIIIMMI 
Db 33 0 GTGCAGCTGA 33 9 



RESULT 11 
US-09-813-271B-7 

; Sequence 7, Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 

biologically active protein 
NUMBER OF SEQUENCES: 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: No. US20020115834Alartis Patent Department 

STREET: 564 Morris Avenue 

CITY: Summit 

STATE: New Jersey 

COUNTRY: USA 

ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 



SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/8 13 , 271B 
; FILING DATE: 20-Mar-2001 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95 / 0271 9 

FILING DATE: 12 -Jul -95 

APPLICATION NUMBER: EPO 94810439.3 

FILING DATE: 25-Jul-94 
ATTORNEY /AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 22640 

REFERENCE/ DOCKET NUMBER: 4-2 0 03 9C/ C1C1/USN 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (908) 522-6940 

TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 7: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 336 base pairs 

TYPE: nucleic acid 

STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: other nucleic acid 

DESCRIPTION: /desc = "recombinant hybrid DNA of 
IMMEDIATE SOURCE: 

CLONE: E. coli LC13 7/pPLMu . TGF-beta 1 ( 44 /4 5 ) beta3 
FEATURE : 

NAME /KEY : mat_pept ide 

LOCATION: 1..132 

OTHER INFORMATION: /product= "N-terminal 44 amino 
acids of human TGF-betal" 
FEATURE : 

NAME/KEY: ma t_pept ide 
LOCATION: 133 . .336 

OTHER INFORMATION: /product= "C-terminal 68 amino 
acids of human TGF-beta3 M 
FEATURE : 

NAME /KEY: CDS 
LOCATION: 1. .336 

OTHER INFORMATION: /product= "hybrid TGF-beta named 
TGF-beta 1-3" 
SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
US-09-813-271B-7 

Query Match 16.7%; Score 199.8; DB 10; Length 336; 

Best Local Similarity 78.2%; Pred. No. 7e-46; 

Matches 240; Conservative 0; Mismatches 67; Indels 0; Gaps 0; 
Qy 8 88 CAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTG 94 7 



Db 




Qy 



94 8 GAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTA 10 07 



Db 




Qy 1008 CATCTGGAGCCTAGACACTCAGTAC^GCAAGGTCCTGGCTCTGTAC^CCAGCAaUVCCC 1067 



I II Ml MINI llllll III 1 1 1 1 MINIMI I Mill 

Db 150 CCTCCGCAGTGC^GACAC^CCC^C^GCACGGTGCTGGGACTGTAa^CACTCTGAACCC 209 

Qy 1068 GGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTA 1127 

I II II II MM 1 1 1 1 1 1 1 1 1 1 MM MMMM III Mill 1 1 1 1 

Db 210 TGAAGCATCTGCCTCGCCTTGCTGCGTGCCCCAGGACCTGGAGCCCCTGACCATCCTGTA 26 9 

Qy 112 8 CTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAA 118 7 

III II II II Mill MMMM IIMIIMI I III II II II 

Db 27 0 CTATGTTGGGAGGACCCCCAAAGTGGAGCAGCTCTCCAACATGGTGGTGAAGTCTTGTAA 32 9 

Qy 1188 GTGCAGC 1194 

II III 

Db 33 0 ATGTAGC 33 6 



RESULT 12 
US-09-906-158-3 

Sequence 3, Application US/09906158 
Publication No. US20030078217A1 
GENERAL INFORMATION: 
APPLICANT: Brett P. Monia 
APPLICANT: Susan M. Freier 

TITLE OF INVENTION: ANTI SENSE MODULATION OF TRANSFORMING GROWTH FACTOR-BETA 3 
EXPRESSION 

FILE REFERENCE: RTS-0257 

CURRENT APPLICATION NUMBER: US/ 09/ 9 06 , 158 
CURRENT FILING DATE: 2001-07-14 
NUMBER OF SEQ ID NOS : 168 
SEQ ID NO 3 
LENGTH: 2574 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME /KEY: CDS 
LOCATION: (254) . . . (1492) 
US-09-906-158-3 

Query Match 16.5%; Score 198; DB 11; Length 2574; 

Best Local Similarity 51.0%; Pred. No. 3.9e-45; 

Matches 631; Conservative 0; Mismatches 535; Indels 72; Gaps 4; 
Qy 26 TGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCG 8 5 

Ml III I I Ml II III MUM III I II 

Db 261 TGCACTTGCAAAGGGCTCTGGTGGTCCTGGCCCTGCTGAACTTTGCCACGGTCAGCCTCT 32 0 

Qy 8 6 GACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 14 5 

MMMM MM Ml I III I I I I III MM I I M MM 

Db 321 CTCTGTCCACTTGCACCACCTTGGACTTCGGCCACATCAAGAAGAAGAGGGTGGAAGCCA 38 0 

Qy 146 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 2 05 

II I M MM M MUM Ml MMMM I II II 

Db 381 TTAGGGGA»GATCTTGAG(^GCTCAGGCT»C(^GCCCCCCTGAGCCAACGGTGATGA 44 0 

Qy 2 06 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 265 

II II I II I II MM MMMM MM llllll 

Db 4 41 CCCACG TCCCCTATCAGGTCCTGGCCCTTTACAACAGCACCCGGGAGCTGCTGG 4 94 



Qy 2 66 CCGGGGAAAGTGTCGAACCGGAGCCCG AGCCAGAGGCGGACTACT 310 

II II II I I I I I I I III I I I I I I I I 

Db 4 95 AGGAGATGCATGGGGAGAGGGAGGAAGGCTGCACCCAGGAAAACACCGAGTCGGAATACT 554 

Qy 311 ACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAAT 370 

I Mill MM I MM I I I MM 

Db 555 ATGCCAAAGAAATCCATAAATTCGACATGATCCAGGGGCTGGCGGAGCACAACGAACTGG 614 

Qy 371 TCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGG 43 0 

III I I II I III I llllll 

Db 615 CTGTCTGCCCTAAAGGAATTACCTCCAAGGTTTTCCGCTTCAATGTGTCCTCAGTGGAGA 674 

Qy 431 TGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAG 4 90 

M III I I II I III I II III I 

Db 675 AAAATAGAACCAACCTATTCCGAGCAGAATTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 734 

Qy 4 91 TGGAGCAGCACGTGGAG CTATAC CAGAAATA CAG CAATG 52 9 

MM M I II M I II I II I 

Db 73 5 CTAAGCGGAATGAGCAGAGGATCGAGCTCTTCCAGATCCTTCGGCCAGATGAGCACATTG 7 94 

Qy 53 0 ATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 58 9 

llllll III II Mill I I I I I IIIIIIIIII 

Db 795 CCAAACAGCGCTATATCGGTGGCAAGAATCTGCCCACACGGGGCACTGCCGAGTGGCTGT 854 

Qy 590 CCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTT 64 9 

: M 1 1 1 1 1 1 1 1 1 1 I Mill IMMMI I llllll I I III 

Db 855 CCTTTGATGTCACTGACACTGTGCGTGAGTGGCTGTTGAGAAGAGAGTCCAACTTAGGTC 914 

Qy 650 TTCGCCTCAGTGCCCACTCTTCCTCT GACA 67 9 

I Ml Ml II I I III 

Db 915 TAGAAATCAGCATTCACTGTCCATGTCACACCTTTCAGCCCAATGGAGATATCCTGGAAA 974 

Qy 68 0 GCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACC 73 9 

II I I I I I II I MM I I II II II 

Db 975 ACATTCACGAGGTGATGGAAATCAAATTCAAAGGCGTGGACAATGAGGATGACCATGGCC 1034 

Qy 74 0 TGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGA 7 9 9 

I I II Ml II Mill I I I I 

Db 1035 GTGGAGATCTGGGGCGCCTCAAGAAGCAGAAGGATCACCACAACCCTCATCTAATCCTCA 10 94 

Qy 8 00 GGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACTACTGCTTCA 85 9 

I Ml III I I III I III I I 

Db 10 95 TGATGATTCCCCCACACCGGCTCGACAACCCGGGCCAGGGGGGTCAGAGGAAGAAGCGGG 1154 

Qy 8 60 GCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCT 919 

I Ml I Ml Ml 1 11 M M I M MM I Ml 

Db 1155 CTTTGGACACCAATTACTGCTTCCGCAACTTGGAGGAGAACTGCTGTGTGCGCCCCCTCT 1214 

Qy 92 0 ACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATG 97 9 

1 1 1 1 1 1 1 1 1 1 1 1 Ml 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 I IMMMI 1 1 1 M 1 1 1 1 III 

Db 1215 ACATTGACTTCCGACAGGATCTGGGCTGGAAGTGGGTCCATGAACCTAAGGGCTACTATG 12 74 

Qy 98 0 CCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGG 103 9 

Ml MM II M II II III II I II MM MM II 

Db 1275 CCAACTTCTGCTC^GGCCCTTGCCCATACCTCCGCAGTGC^GACACAACCC^C^GC^CGG 1334 



Qy 


1040 


TCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGC 


1099 


Db 


1335 


1 MM lllllllll 1 Mill 1 II 1 1 M MM MMMMMI 1 

TGCTGGGACTGTAG^CACTCTGAACGCTGAAGGATCTGCCTCGCCTTGCTGCGTGCCLL 


i j y 4 


Qy 


1100 


AGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGGCGCAAGCCCAAGGTGGAGCAGC 


1159 


Db 


1395 


Ml lllllll! Ill Mill lllllll II II II Mill IMIIIIIII 

AGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCAGC 


1454 


Qy 


1160 


TGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 




Db 


1455 


1 lllllllll 1 III II II II II llilll 
TCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGCTGA 14 92 





RESULT 13 
US-10-028-158-20 

; Sequence 20, Application US/10028158 

; Publication No. US20020110833A1 

; GENERAL INFORMATION: 

; APPLICANT: Caniggia , Isabella 

; APPLICANT: Post, Martin 

; APPLICANT: Lye, Stephen 

; TITLE OF INVENTION: METHODS TO DIAGNOSE A REQUIRED REGULATION OF 

; TITLE OF INVENTION: TROPHOBLAST 

; FILE REFERENCE: 11757. 38USWO 

; CURRENT APPLICATION NUMBER: US/10/028,158 

; CURRENT FILING DATE: 2001-12-20 

; PRIOR APPLICATION NUMBER: US/09/38 0 , 662 

; PRIOR FILING DATE: 1999-12-21 

; PRIOR APPLICATION NUMBER: PCT/CA98 / 0 0 1 8 0 

; PRIOR FILING DATE: 1998-03-05 

; PRIOR APPLICATION NUMBER: US 60/039,919 

; PRIOR FILING DATE: 1997-03-07 

; NUMBER OF SEQ ID NOS : 24 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 20 

LENGTH: 2574 

TYPE: DNA 
; ORGANISM : Homo sapiens 

FEATURE : 

NAME /KEY: CDS 

LOCATION: ( 2 54 ) . . ( 14 92 ) 
US-10-028-158-20 

Query Match 16.5%; Score 198; DB 13; Length 2574; 

Best Local Similarity 51.0%; Pred. No. 3.9e-45; 

Matches 631; Conservative 0; Mismatches 535; Indels 72; Gaps 4; 
Qy 26 TGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTGCTGACGCCTGGCCGGCCGGCCGCCG 85 

Mi III I I Ml II III MUM Mil II 

Db 261 TGCACTTGCAAAGGGCTCTGGTGGTCCTGGCCCTGCTGAACTTTGCCACGGTCAGCCTCT 320 

Qy 86 GACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGCATCGAGGCCA 14 5 

IIIIIMI MM Ml I Ml I I I I Ml MM I I II MM 

Db 321 CTCTGTCCACTTGCACCACCTTGGACTTCGGCCACATCAAGAAGAAGAGGGTGGAAGCCA 380 

Qy 146 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 2 05 

II I II Mill II MIMI MM Mllllllll I II II 



Db 381 TTAGGGGACAGATCTTGAGCAAGCTCAGGCTCACCAGCCCCCCTGAGCCAACGGTGATGA 44 0 

Qy 206 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 2 65 

I I II III I II Mill MINIMI Mill MINI 

Db 441 CCCACG TCCCCTATCAGGTCCTGGCCCTTTACAACAGCACCCGGGAGCTGCTGG 4 94 

Qy 266 CCGGGGAAAGTGTCGAACCGGAGCCCG AG C CAG AGG CGG ACT ACT 310 

II II II MM I I I III MM MM 

Db 4 95 AGGAGATGCATGGGGAGAGGGAGGAAGGCTGCACCCAGGAAAACACCGAGTCGGAATACT 554 

Qy 311 ACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAAT 370 

I Mill MM I III I I I I I I 1 1 

Db 555 ATGCCAAAGAAATCCATAAATTCGACATGATCCAGGGGCTGGC^ 614 

Qy 371 TC^GGGC^CCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGG 430 

Ml I I I I MM I I I I I I I 

Db 615 CTGTCTGCCCTAAAGGAATTACCTCCAAGGTTTTCCGCTTCAATGTGTCCTCAGTGGAGA 674 

Qy 431 TGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAG 4 90 

II III I I I I I III I II III I 

Db 675 AAAATAGAACCAACCTATTCCGAGCAGAATTCCGGGTCTTGCGGGTGCCCAACCCCAGCT 734 

Qy 4 91 TGGAG CAG CA CGTGGAG CTATACCAGAAATACAGCAATG 52 9 

III I I I I II II I I I II II 

Db 73 5 CTAAGCGGAATGAGCAGAGGATCGAGCTCTTCCAGATCCTTCGGCCAGATGAGCACATTG 794 

Qy 53 0 ATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGT 58 9 

MMM III II MINI I I I I I MMMMM 

Db 795 CCAAACAGCGCTATATCGGTGGCAAGAATCTGCCC 854 

Qy 590 CCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTT 64 9 

1 1 1 1 ; 1 1 1 1 1 > 1 1 I MMM MINIM I MMM I I IN 

Db 8 55 CCTTTGATGTCACTGACACTGTGCGTGAGTGGCTGTTGAGAAGAGAGTCCAACTTAGGTC 914 

Qy 650 TTCGCCTCAGTGCCCACTCTTCCTCT GACA 67 9 

I INI I I I I I I I I III 

Db 915 TAGAAATC^GC^TTC^CTGTCC^TGTCA(^CCTTTCAGCCa^TGGAGATATCCTGGAAA 974 

Qy 68 0 GCAAAGATAACACACTCCA.CGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACC 73 9 

II I I I I I II I II II I I II II II 

Db 97 5 ACATTCACGAGGTGATGGAAATCAAATTCAAAGGCGTC 1034 

Qy 74 0 TGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGA 799 

I I II I II II I II II I I I I 

Db 103 5 GTGGAGATCTGGGGCGCCTC^GAAGC^GAAGGATCACCACAACCCTCATCTAATCCTCA 1094 

Qy 800 GGGCCCAGCACCTGC^C^GCTCCCGGC^CCGCCGAGCCCTGGATACauVCTACTGCTTCA 8 59 

I I II IN I I INI Mill 

Db 10 95 TGATGATTCCCCCACACCGGCTCGACAACCCGGGCCAGGGGGGTCAGAGGAAGAAGCGGG 1154 

Qy 860 GCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCT 919 

I III I I II INI NIIIINN INN I INI 

Db 1155 CTTTGGACACCAATTACTGCTTCCGCAACTTGGAGGAGAACTGCTGTGTGCGCCCCCTCT 1214 

Qy 920 ACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATG 97 9 

ININNINN Nil NININNINN I N II II 1 1 NIIIIMI Ml 

Db 1215 ACATTGACTTCCGACAGGATCTGGGCTGGAAGTGGGTCCATGAACCTAAGGGCTACTATG 12 74 



Qy 980 CCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGG 103 9 

1 1 1 1 MUM M M M M Ml M I M MMM MMM 1 1 

Db 127 5 CCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGTGCAGACACAACCCACAGCACGG 1334 

Qy 1040 TCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGC 1099 

I MM MMMMI I MMI I M M M MM MMMMMI I 

Db 1335 TGCTGGGACTGTACAACACTCTGAACCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCCC 1394 

Qy 1100 AGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGC 1159 

Ml llllllll Ml Mill lllllll II II II Mill Uh MM 

Db 1395 AGGACCTGGAGCCCCTGACCATCCTGTACTATGTTGGGAGGACCCCCAAAGTGGAGCAGC 14 54 

Qy 1160 TG T C CAACATG AT CGTG CGTT CCTG CAAGTG CAG CTGA 1197 

I lllllllll I Ml M M M M MMM 

Db 14 55 TCTCCAACATGGTGGTGAAGTCTTGTAAATGTAGCTGA 14 92 



RESULT 14 
US-09-957-458B-9 

; Sequence 9, Application US/09957458B 
; Publication No. US20030166271A1 
; GENERAL INFORMATION: 
; APPLICANT: Chen, Una 

TITLE OF INVENTION : Method for growing stem cells 
; FILE REFERENCE: P66567US0 

; CURRENT APPLICATION NUMBER: US/ 09/ 95 7 , 4 58B 

; CURRENT FILING DATE: 2001-09-21 

; PRIOR APPLICATION NUMBER: PCT/EP00/08247 

/ PRIOR FILING DATE: 2000-08-24 

; PRIOR APPLICATION NUMBER: EP 99116533 

; PRIOR FILING DATE: 1999-08-24 

; NUMBER OF SEQ ID NOS : 10 

SOFTWARE: Patent In version 3.2 
; SEQ ID NO 9 

LENGTH: 4382 

TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: Vector for transforming supporting cell with a foreign to 
express 

OTHER INFORMATION: a gene product of interest 
US-09-957-458B-9 

Query Match 16.4%; Score 196; DB 12; Length 4382; 

Best Local Similarity 51.8%; Pred. No. 1.6e-44; 

Matches 646; Conservative 0; Mismatches 530; Indels 72; Gaps 6; 
Qy 16 CTG CGG CTCTTG CCGCTG CTGCTG CCG CTG CTGTGG CTG CTAGTGCTGACG C CTGG CCGG 75 

MM M I III III llllll Ml MMM I I 

Db 4 55 CTGCAGCCCATGCACTTGCAAAGGGCTCTGGTAGTCCTGGCCCTGCTGAACTTGGCCACA 514 

Qy 76 CCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTGGTGAAGCGGAAGCGC 13 5 

I I llllllll MM M I III I I I I Ml I II I I 

Db 515 ATCAGCCTCTCTCTGTCCACTTGCACCACGTTGGACTTCGGCCACATCAAGAAGAAGAGG 574 



Qy 136 ATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAG 195 



I II MINI I II Mill II MINI MM 1 1 1 1 1 i 1 1 1 I 

Db 575 GTGGAAGCCATTAGGGGACAGATCTTGAGCAAGCTCAGGCTCACCAGCCCCCCTGAGCCA 634 

Qy 196 GGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGC 2 55 

II II I I II I II I II Mill MMIMIMI Mill 

Db 635 TCGGTGATGACCCACG TCCCCTATCAGGTCCTGGCACTTTACAACAGCAGCCGG 688 

Qy 256 GACCGGGTAGCCG GGGAAAGTGT CG AA CCGGAG C C CGAG C CAGAG 3 00 

M I I I I MM II I M I Ml 

Db 68 9 GAGTTGCTGGAAGAGATGCACGGGGAGAGGGAGGAAGGCTGCACTCAGGAGACCTCGGAG 74 8 

Qy 301 GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 360 

I II Mill Mill Ml II I Ml I I I II 

Db 74 9 TCTGAGT A CT ATG C CAAAG AG ATC CATAAATT CG A CATGATC CAGGGACTGG CGGAG CAC 8 08 

Qy 361 TATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 420 

MM I II II I I II II II III 

Db 8 09 AATGAACTGGCCGTCTGCCCCAAAGGAATTACCTCTAAGGTTTTTCGTTTCAATGTGTCC 868 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 

III III MM I I I I III I II I 

Db 869 TCAGTGGAGAAAAATGGAACCAATCTGTTCCGGGCAGAGTTCCGGGTCTTGCGGGTGCCC 928 

Qy 481 A AGTTAAAAGTGG AG CAG CA CGTGG AG CTATACCAGAAA 519 

I II I I I MM II M III 

Db 929 AAC C CCAG CT C CAAG CG CA CAGAG CAGAG AATTGAG CTCTT C CAG ATACTT CGA CCGG AT 988 

Qy 52 0 TACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCG 57 9 

i m i mm; II II Ml M I II I I I I 

Db 98 9 GAGC^CATAGCCAAGCAGCGCTACATAGGTGGCAA 104 8 

Qy 58 0 GAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCT 63 9 

M MMMM 1 1 MINIM I MINI MINIM I MINI I 

Db 104 9 GAATGGCTGTCTTTCGATGTCACTGACACTGTGCGCGAGTGGCTGTTGAGGAGAGAGTCC 1108 
Qy 64 0 ATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGC AA 683 

I MM I MM Mill I I I I Ml I I 

Db 1109 AACTTGGGTCTGGAAATGAGGATCCACTGTCCATGTCACA 1168 

Qy 684 AGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGC 74 3 

II I I I I II I Mill M I 

Db 1169 ATACTGGAAAATGTTCATGAGGTGATGGAAATCAAATTCAAAGGAGTGGACAATGAAGAT 1228 
Qy 744 CACCATTCACGGCATGAACCGGCCCTTCCT GCTCCTCATGGCCACCCCG 7 92 

Mill II M I II I II III II I 

Db 1229 GACCATGGCCGTGGAGACCTGGGGCGTCTCAAGAAGGAAAAG 12 88 

Qy 7 93 CTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACTAC 8 52 

Ml I II I I I I II III M II II I I 

Db 128 9 CTGATCCTC^TGATGATCCCCCC^C^CCGACTGGAC^GCCC^GGCC^GGGCAGTCAGAGG 134 8 

Qy 8 53 TGCTTCAGCTCCACGGACTACAAGGATGACGACGACAA GGAGAAGAACTGCTGCGTG 909 

II II MM III I II III MM IMMIIMI 1 1 

Db 134 9 AAGAAGAGGGCCCTGGACACCAATTACTGCTTCCGCAACCTGGAGGAGAACTGCTGTGTA 14 08 

Qy 910 CGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAG 969 

II I II II I I ' I MM II I II MM I III I II Mill III 



Db 



14 09 CGCCCCCTTTATATTGACTTCCGGCAGGATCTAGGCTGGAAATGGGTCCACGAACCTAAG 1468 



Qy 97 0 GGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAG 102 9 

II Ml M II 1 1 1 MMM II M M II Ml M I III MMM 

Db 14 69 GGTTACTATGCCAACTTCTGCTCAGGCCCTTGCCCATACCTCCGCAGCGCAGACACAACC 152 8 

Qy 103 0 TACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGC 1089 

I MM Ml M I II MMM I MMI I lllll II MM III 

Db 152 9 CATAGCACGGTGCTTGGACTATACAACACCCTGAACCCAGAGGCGTCTGCCTCGCCATGC 158 8 

Qy 1090 TGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAG 114 9 

Mill II MM MMMM III lllll IMMII MMM I I MMM 

Db 158 9 TGCGTCCCCCAGGACCTGGAGCCCCTGACCATCTTGTACTATGTGGGCAGAACCCCCAAG 1648 

Qy 115 0 GTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

MMMMMMMMIMM I Ml II II IIIIIIIIMM 

Db 164 9 GTGGAGCAGCTGTCCAACATGGTGGTGAAGTCGTGTAAGTGCAGCTGA 1696 



RESULT 15 
US-09-813-271B-5 

; Sequence 5, Application US/09813271B 
; Patent No. US20020115834A1 
GENERAL INFORMATION: 
APPLICANT: 

(A) Nico Cerletti 
TITLE OF INVENTION: New process for the production of 
; biologically active protein 

NUMBER OF SEQUENCES : 13 
CORRESPONDENCE ADDRESS : 

ADDRESSEE: No. US20020115834Alart is Patent Department 

STREET: 564 Morris Avenue 

CITY: Summit 

STATE: New Jersey 

COUNTRY: USA 

ZIP: 07901 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS -DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/8 13 , 271B 

FILING DATE: 20-Mar-2001 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: PCT/EP95/ 02 71 9 

FILING DATE: 12 -Jul -95 

APPLICATION NUMBER: EPO 94810439-3 

FILING DATE: 25-Jul-94 
ATTORNEY /AGENT INFORMATION: 

NAME: Pfeiffer, Hesna J. . 

REGISTRATION NUMBER: 2264 0 

REFERENCE/DOCKET NUMBER : 4 -2003 9C/C1C1/USN 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (908) 522-6940 
TELEFAX: (908) 522-6955 
INFORMATION FOR SEQ ID NO: 5: 



SEQUENCE CHARACTERISTICS: 

LENGTH: 33 9 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : doubl e 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA to mRNA 
IMMEDIATE SOURCE: 

CLONE: E. coli LC137/pPLMu.hTGF-beta3 (DSM 5658) 
FEATURE : 

NAME/ KEY: CDS 
LOCATION: 1. .336 

OTHER INFORMATION: /product= "human TGF-beta3" 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-813-271B-5 

Query Match 16.1%; Score 193; DB 10; Length 339; 

Best Local Similarity 76.0%; Pred. No. 5.6e-44; 

Matches 238; Conservative 0; Mismatches 75; Indels 0; Gaps 0; 
Qy 88 5 CGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGG 944 

I II II 1 1 MINIUM Mill I IMMMMMMIMI MM Mill 

Db 27 CAACTTGGAGGAGAACTGCTGTGTGCGCCCCCTCTACATTGACTTCCGACAGGATCTGGG 86 

Qy 94 5 CTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCC 10 04 

1 1 1 1 1 1 1 1 1 1 I MMMM MIMIIII II M M I MINI 1 1 II M II 

Db 87 CTGGAAGTGGGTCCATGAACCTAAGGGCTACTATGCCAACTTCTGCTCAGGCCCTTGCCC 146 

Qy 1005 CTACATCTGGAGCCTAGACACTCAGTAC^GC^GGTCCTGGCTCTGTACAACCAGC^C^ 1064 

III II I II MINI MUM III MM MMMM I M 

Db 14 7 ATACCTCCGCAGTGCAGACACAACCCACAGC^ 2 06 

Qy 1065 CCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGT 1124 

Ml I II II II MM MIIMIMM MM MMMM III Mill I 

Db 2 07 CCCTGAAGCATCTGCCTCGCCTTGCTGCGTGCCCCAGGACCTGGAGCCCCTGACCATCCT 266 

Qy 112 5 GTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTG 1184 

llllll MM II Mill MIIMIMM MIMIIII I Ml M 1 1 

Db 267 GTACTATGTTGGGAGGACCCCCAAAGTGGAGCAGCTCTCCAACATGGTGGTGAAGTCTTG 326 

Qy 118 5 CAAGTG CAG CTGA 1197 

M M MINI 
Db 327 TAAATGTAGCTGA 339 



Search completed: October 28, 2003, 09:04:14 
Job time : 361.927 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



October 27, 2003, 10:47:27 ; Search time 2970.42 Seconds 

(without alignments) 
9794.056 Million cell updates/sec 



Title: US-10-017-372E-12 
Perfect score: 1197 

Sequence: 1 atggcgccttcggggctgcg gttcctgcaagtgcagctga 1197 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 22781392 seqs, 12152238056 residues 

Total number of hits satisfying chosen parameters: 45562784 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing : Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : EST:* 

1: em_estba:* 

2: em_esthum:* 

3: em_estin:* 

4: em_estmu:* 

5: em_estov:* 

6: em_estpl:* 

7: em_estro:* 

8: em_htc:* 

9: gb_estl:* 
10: gb_est2:* 
11: gb_htc:* 
12: gb_est3:* 
13: gb_est4:* 
14: gb_est5:* 
15: em_estfun:* 
16: em_estom:* 
17: em_gss_hum:* 
18: em_gss_inv:* 
19: ern_gss_pln: * 
20: em_gss_vrt:* 
21: em_gss_fun:* 
22: em_gss_mam:* 
23 : em_gss_mus : * 
24: em_gss_pro:* 
25: em_gss_rod:* 
26: em_gss_phg:* 
27: em_gss_vrl : * 
28: gb__gssl:* 
29: gb_gss2:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 
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BM562135 
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900 
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c 
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c 
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c 
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10 
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, 7 
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.0 
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BE260971 
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.6 


46. 


.9 
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BQ675698 


BQ675698 


AGENCOURT 




13 
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.2 


44 . 
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BI818841 


BI818841 
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521. 


.2 


43 . 
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12 


BQ054305 
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AGENCOURT 


c 


15 
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.2 


43. 


.5 
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9 
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AL530080 AL530080 


c 


16 
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43 . 


, 4 
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12 


BI084718 


BI084718 


602869722 




17 
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42. 


. 9 


1049 


12 


BQ053417 


BQ053417 


AGENCOURT 




18 


513 


.6 


42. 


.9 
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13 


BQ952138 


BQ952138 


AGENCOURT 




19 


508 . 


.4 


42 . 


,5 
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12 


BI195242 


BI195242 


602944518 




20 
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.4 


42. 


,3 
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12 
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AGENCOURT 




21 
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.6 


42. 


.2 
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22 
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41, 
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.7 
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24 
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.3 
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28 
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.6 
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.6 
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29 
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.8 
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.5 
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.2 
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.8 
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13 
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31 
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.6 


38, 


,5 
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BI686959 
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38. 


.4 
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453. 


.4 


37. 


.9 


633 


14 


CB577186 


CB577186 


AMGNNUC : N 


c 
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.4 
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.2 
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13 


BX097979 
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c 


40 
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37, 


,2 
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13 


BX412434 


BX412434 


BX412434 




41 


442 . 


. 4 


37, 


,0 
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10 


BE615330 


BE615330 


601280728 


c 


42 
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6 


36. 


, 7 
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13 
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UI-H-FE1- 


c 


43 


439. 


.4 


36. 


. 7 


1015 


10 


BG683840 


BG683840 


602651694 


c 


44 


437 . 


. 6 


36. 


.6 
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14 


CA425775 
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UI-H-FE1- 


c 


45 


436. 


. 6 


36. 


. 5 


700 


13 


BU633922 


BU633922 


UI-H-FL1- 



ALIGNMENTS 



RESULT 1 
BM562135 

LOCUS BM562135 1072 bp mRNA linear EST 20-FEB-2002 

DEFINITION AGENCOURT_6562 032 NIH_MGC_118 Homo sapiens cDNA clone IMAGE : 5745463 

5 ' , mRNA sequence. 
ACCESSION BM562135 



VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BM562135 .1 GI : 18807966 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1072) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs -r@mail .nih.gov 

Tissue Procurement: Life Technologies, Inc. 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http : //image . llnl .gov 

Plate: LLAM12768 row: h column: 08 

High quality sequence start: 9 

High quality sequence stop: 64 7. 
Location/Qualifiers 
1. .1072 

/organism="Homo sapiens" 
/mol_type= " mRNA 11 
/db_xref ="taxon: 9606" 
/ clone=" IMAGE : 5745463" 
/tissue_type=" leukocyte" 
/lab_host="DH10B" 
/clone_lib="NIH_MGC_118" 

/note="Vector: pCMV-SP0RT6 ; Site_l: Not I; Site_2: EcoRV 
(destroyed) ; RNA source leukocytes from anonymous pool of 
non-activated adult donors. Library is oligo-dT primed 
and directionally cloned (EcoRV site is destroyed upon 
cloning). Average insert size 1.7 kb, insert size range 
1.2-3.3 kb. Library is normalized and enriched for 
full-length clones and was constructed by C. Gruber 
(Invitrogen) . Research Genetics tracking code 027. Note: 
this is a NIH_MGC Library." 
217 a 356 c 313 g 186 t 



Query Match 59.9%; Score 717.6; DB 12; Length 1072; 

Best Local Similarity 83.5%; Pred. No. l.le-151; 

Matches 885; Conservative 0; Mismatches 144; Indels 31; Gaps 



Qy 

Db 

Qy 
Db 



117 G CTGGTGAAG CGG AAG CG CATCGAGG C CATT CG CGG CCAG ATTCTGT C CAAG CTTCGG CT 176 

IIIIIIIIIIIIIIIIIIIIII MIIIII IIMIMIMI IIIIIIIIIII Mill 

20 GCTGGTGAAGCGGAAGCGCATCTAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCT 79 
177 TGCCAGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGC 236 

1 1 1 1 1 1 M 1 1 1! 1 1 1 1 1 1 1 1 1 1 1 I II 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 MIM II II II 

8 0 CGCCAGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGC 139 



Qy 



2 37 TCTTTACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCC 296 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J I MIMIM MM IMIIIIMMIMMI 



Db 



14 0 CCTGTACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCC 199 



Qy 


297 


Db 


200 


Qy 


357 


Db 


260 


Qy 


417 


Db 


320 


Qy 


477 


Db 


380 


Qy 


537 


Db 


440 


Qy 


597 


Db 


500 


Qy 


657 


Db 


560 


Qy 


717 


Db 


620 


Qy 


777 


Db 


680 


Qy 


836 


Db 


740 


Qy 


896 


Db 


776 


Qy 


956 


Db 


836 


Qy 


1016 


Db 


896 


Qy 


1076 


Db 


956 



297 AGAGGCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCA 3 56 

Mill IIIIMMIIIIIIMIMIIIIIIMIIilllllllllllM I MM I 

2 00 TGAGGCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGA 2 59 
AATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGA 416 

MINIMI II MINI I I MMM IIMMII I IIIIMM M II 

AATCTATGACAAGTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGA 319 
GCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAG 4 76 

llllll Mllllli II lllll II llllllll IMIMIMIMM llllllll 

GCTCCGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAG 3 7 9 



III 



GCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGA 5 96 

III M 1 1 1 II II II II II I II 1 1 1 1 1 lllll lllll II llllll I II Mill 

G CGATACCT CAG CAACCGGCTG CTGG CA CCCAG CGACT CG CCAGAGTGGTTATCTTTTGA 4 99 
TGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCT 656 

I M I M 1 1 M I M M 1 1 1 1 1 Ml < III III III II II lllll llllllll 

TGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCT 55 9 
CAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTT 716 

II MMM MM Mllllli MMIMMM II Mill M MMMM 

TAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTT 619 
CAATT CTGG CCGCCGGGGTGAC CTGG CCACCATTCACGGCATGAACCGGCCCTTCCTGCT 776 

II I I IIMMII MIMIMMMMMIMI 1 1 1 1 1 1 1 1 1 1 1 M I IIIIMM 

CACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCT 67 9 
CCTC^TGGCC^CCCCGCTGGAGAGGGCCC^GC^CCTGC^CAGCTCCCGGCACCGCCGAG - 83 5 

I f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 lllll MMMMMIMIIMM 

TCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGC 73 9 
CCCTGGATACC^ACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGA 8 95 

lllllll llllllll IMMMIMM I Mill 

CCCTGGACACCAACTATTGCTTCAGCTCCCC GGAGA 775 

AGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGA 95 5 

MMMMMMIMMIMM IMMMMIMM I llllll 1 1 1 1 1 1 1 1 ; 1 1 1 ! 

AGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAGGGACCTCGGCTGGAAGTGGA 83 5 
TTC^TGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGA 1015 

I II II Ml I I II III I Mill I IIMMII IIMMII MMMM MM 

TCCACGAGCCCCAAGGCTACCATGCCCACTTCTGCCTCGGGCCCTGCCCCTACATTTGGA 8 95 



MM Mill I Mlllll MMM lllllll II II MM II I 



Qy 1132 GTGGGCCGC--AAGCCCAAGGTGGAGCAGCTGTCCAACAT 1169 

I I I I III I MINI III II lillll I I I I 
Db 1016 CTGGGGCGCCAAGCCCCAAGTGGGAACACCTGTCCCACAT 1055 



RESULT 2 
BX349319 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BX349319 900 bp mRNA linear EST 05-MAY-2003 

BX349319 Homo sapiens B CELLS (RAMOS CELL LINE) COT 25 -NORMALIZED 
Homo sapiens cDNA clone CSODL010YL07 5-PRIME, mRNA sequence. 
BX349319 

BX349319.1 GI :30379410 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 900) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : / /www . genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CSOBAG059ZD04_CS05596_l&cluster=9160 . r . 
Contact : Feng Liang Email : fliang@lifetech.com URL : 
http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0BAG059ZD04_CS05596_1 . 

Locat ion/Qua 1 if iers 

1. .900 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db_xref ="taxon: 9606" 
/clone= " CSODLO 1 0YL07 " 

/cell_type="B CELLS (RAMOS CELL LINE) COT 25 -NORMALIZED" 
/eel l_line=" RAMOS CELL LINE" 

/clone__lib="Homo sapiens B CELLS (RAMOS CELL LINE) COT 
2 5 -NORMALIZED" 

/note="lst strand cDNA was primed with a NotI -ol igo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
179 a 292 c 270 g 155 t 4 others 



Query Match 58.6%; Score 702; DB 13; Length 900; 

Best Local Similarity 88.0%; Pred. No. 3.5e-148; 

Matches 762; Conservative 0; Mismatches 104; Indels 0; Gaps 0; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 60 

Ml 1 1 M II MINIM IIIIMMIIIM 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 



Db 


35 


ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 


94 


Qy 


61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 


120 


Db 


95 


IIIMIMIMIMIMIIIMI INN MIIIMMIMM M MINIMUM 

CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 


154 


Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 


180 


Db 


155 


MMMIMIMMM1MMMMI MMMMMI 1 II 11 1 M 1 M MIM Ml 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 


214 


Qy 


181 


AGCCCCGGGAGCCAGGGGGACGTGCGGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 


240 


Db 


215 


IMMMMMMMIMM M 1 II M II M M II II M 1 Mill 1 1 II II II 

AGCCCCCCGAGCCAGGGGGAGGTGGCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 


274 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 


300 


Db 


275 


MMMM MMMMMIMI IMIMII MM M 1 1 1 II 1 II II 1 1 1 1 1 III 

TA CAA CAG CA C CCG CGA C CGGGTGG CCGGGG AGAGTG CAG AAC CGGAG C C CGAG C CTGAG 


334 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 


360 


Db 


335 


II 1 M 1 1 1 1 1 1 1 II 1 ! II 1 1 II II 1 M M 1 ! II 1 1 1 M II 1 11 M 1 MM Mill 

GCCGACTACTAGGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


394 


Qy 


361 


TATGATAAATTCAAGGGCACCCGCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTC 


420 


Db 


395 


Mill II MUM 1 1 MINI MMMM 1 MMMM II MIMI 

TATGACAAGTTCAAGC^GAGTACAC^C^GCATATATATGTTCTTCAACACATCAGAGCTC 


454 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 


480 


Db 


455 


II MMMM M MMI M MMMM MMMMMIMI MIMMIIMI 

CGAGAAGGGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


514 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

M 1 1 M II 1 II 1 M 1 M 1 II II 1 II 1 M 1 M 1 II 1 II 1 1 1 1 II 1 1 1 IIMIIIIII 

AAGTTAAAAGTGGAGCAGCACGTGGAGGTGTAGCAGAAATACAGCAACAATTCCTGGCGA 


540 


Db 


515 


574 


Qy 


541 


TACCTCAGCAAGCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTC 

II Mill MIM II MIMI 1 M IIIMIMI 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTC 


600 


Db 


575 


634 


Qy 


601 


ACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGT 

MMMIMIMIIMIMM Ml III III II 1 1 Mill MMMM II 

ACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGC 


660 


Db 


635 


694 


Qy 
Db 


661 
695 


GCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAAT 

lllllll MM MMMM MMMMMI II Mill M 1 1 1 1 1 1 1 ! 1 1 1 

GCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACT 


720 
754 


Qy 


721 


TCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTC 

1 MMMM 1 MMMMMIMI II Ml III MM MM III II MMI Ml 

ALLOuLLbLLCjAbb I bALC I OGCCALCATTC^TGGCATGAACCGGCCTTTTCT 


780 


UD 


/ bb 


8 14 


Qy 
Db 


781 
815 


ATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTG 

IIIIIIIIIIIIIIIMIIMIIIIIIII Mill MIIIIIMIMII IMIMIM 

ATGGCCACCCGGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGNCGAGCCCTG 


840 
874 


Qy 


841 


GATACCAACTACTGCTTCAGCTCCAC 866 




Db 


875 


1 1 MM Mill MM III 

GNACACCACTATTGCTTNAGCTNCAC 90 0 





RESULT 3 

BX355682/c 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 

and Polayes,D. 



BASE COUNT 
ORIGIN 



BX355682 1201 bp mRNA linear EST 05-MAY-2003 

BX355682 Homo sapiens PLACENTA COT 25 -NORMALIZED Homo sapiens cDNA 
clone CS0DI002YJ05 3-PRIME, mRNA sequence. 
BX355682 

BX355682.1 GI:30371987 
EST . 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Euthena; Primates; 
1 (bases 1 to 1201) 
Li,W.B., Gruber,C, Jessee,J. 
Full-length cDNA libraries and normalization 
Unpubl ished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : / / www . genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq-CS0DI002CE03NPl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSODI 002CE03NP1 . 

Location/Qualifiers 

1. .1201 

/organism= M Homo sapiens" 
/mol_type= "mRNA" 
/db_xref ="taxon: 9606" 
/clone=" CSODI 002 YJ05" 

/tissue_type=" PLACENTA COT 25 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 25 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
198 a 326 c 386 g 253 t 38 others 



Query Match 57.8%; Score 692.2; DB 13; Length 1201; 

Best Local Similarity 86.7%; Pred. No. 6.3e-146; 

Matches 824; Conservative 4; Mismatches 93; Indels 29; Gaps 5; 

Qy 2 50 ACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG - GCGGACTA 3 08 

I 'M MUM I - Illlllllllillllll III Ihllll 

Db 104 6 AACCGCGACCGGGTGGCCGGG - - AGAKKCAGAACCGGAGCCCGAGCCTGAGCCCGRACTA 98 9 

Qy 3 09 CTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAA 368 

IMMIMMMIIIMIIMIIMMIMMIIMII I 1 1 1 1 MM II 

Db 988 CTACGCCAAGGAGGTMACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATGACAA 92 9 



Qy 


369 


ATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGC 

Mill 1 1 MINI llllllll 1 llllllll II llllllll Mill 

GTTCAAGCAGAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGC 


428 


Db 


928 


869 


Qy 


429 


GGTGCCGGAACCTGTATTGCTCT - CTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAA 

III II Mill II lllllll 1 Illlllllllllll 1 M 1 1 M 1 1 II 1 1 1 II M 1 

GGTACCTGAACCCGTGTTGCTCTCCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAA 


487 


Db 


868 


809 


Qy 


488 


AAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCA 


547 


Db 


808 


I II 11 1 II II II M ! M ! II II II ! M M 1 M 1 II 1 1 II 1 M 1 1 1 II 1 1 lllllll 

AAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGATACCTCA 


749 


Qy 


548 


GCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAG 

llllllllllllllll Mill Mill II llllll 1 II MIMMIMMIMI 

GCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAG 


607 


Db 


748 


689 


Qy 


608 


TTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACT 

Illlllllllllll III III III II II II II llllllll M lllllll 

TTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGA-GGCTTTCGCCTTAGCGCCCACT 


667 


Db 


688 


630 


Qy 


668 


CTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCC 


727 


Db 


629 


MM IMMMI IIMIIMMI M Mill M MMMMM 1 1 MM 

GCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCC 


570 


Qy 


728 


GCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCA 

MM MMMMMMMMMM IMMMMIMM IMMMI MMMMM 

GCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCA 


787 


Db 


569 


510 


Qy 


788 


CCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCA 

'ill; Mill MIMMMIIMIIIIIIIIIIIII MM 

CCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCGGGCACCGCCGAGCCCTGGACACCA 


847 


Db 


509 


450 


Qy 


848 


ACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAACTGCTGCG 

MM Illlllllllllll lllllll MINIMI 

ACTATTGCTTCAGCTCCAC GGAGAAGNACTGCTGCG 


907 


Db 


449 


414 


Qy 


908 


TGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCA 

MMMMM MMMMMMM IMMMI MMMMMMM II II MM 

TGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCA 


967 


Db 


413 


354 


Qy 


968 


AGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTC 

llllllllllllllll IMMMI llllllll llllllll llllllll Mill 1 

AGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGC 


1027 


Db 


353 


294 


Qy 


1028 


AGTACAGCAAGGTCCTGGCTCTGTAC^CCAGC^C^ACCCGGGCGCGTCGGCGGCGCCGT 

MMMIMMMIIMM Illlllllllllll IIMIIMMI llllllll II 

AbTAC^GCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTC 


1087 


Db 




234 


Qy 


1088 


GCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCA 

MIMIIIIIIIIIIIIIMIIIII II 1 1 1 1 1 1 1 1 1 II II 1 1 1 M 1 1 M 1 1 II 1 M 1 1 

GCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCA 


1147 


Db 


233 


174 


Qy 


1148 


AGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

MMMMIMMMMMIMMMIMM 1 ! 1 1 1 1 1 1 1 1 1 i 1 1 ' 1 1 1 

AGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 12 4 




Db 


173 





RESULT 4 

BX335351/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



BASE COUNT 
ORIGIN 



BX335351 983 bp mRNA linear EST 01-MAY-2003 

BX33 5351 Homo sapiens PLACENTA COT 2 5 -NORMALIZED Homo sapiens cDNA 
clone CS0DI013YH16 3 -PRIME, mRNA sequence. 
BX335351 

BX335351. 1 GI :30308367 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 983) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 

http : //www. genoscope . ens . fr/cgi -bin/cluster . cgi?seq=CSODI 013DD08NP1 
&cluster=9160 . r . Contact : Feng Liang Email : fliang@lifetech.com 
URL : http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CSODI 013DD08NP1 . 

Location/Qualifiers 

1. .983 

/ organism="Homo sapiens" 
/mol_type= n mRNA" 
/db_xref ="taxon: 9606" 
/clone= "CSODI 013YH16" 

/tissue_type=" PLACENTA COT 2 5 -NORMALIZED" 
/clone_lib="Homo sapiens PLACENTA COT 2 5 -NORMALIZED" 
/note="lst strand cDNA was primed with a Not I -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
170 a 280 c 323 g 203 t 7 others 



Query Match 57 . 3%; 

Best Local Similarity 86.6%; 
Matches 8 05; Conservative 



Score 685.8; DB 13; Length 983; 
Pred. No. l.6e-144; 
5; Mismatches 93; Indels 27; 



Qy 



Db 



Gaps 



4; 



2 68 GGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACC 327 

Mill 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M Mill 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 

983 GGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGACTACTACGCCAAGGAGGTCACC 924 



Qy 

Db 

Qy 

Db 



328 CGCGTGCTAATGGTGGAAAGCGGGAACCAAATCTATGATAAATTCAAGGGCACCCCCCAC 3 87 

I II ; I | MM llllllllll II llllll I I III 

923 CGCGTGCTAATGGTGGARACCCACAACGAAATCTATGACAAGTTCA^ 8 64 



3 88 AGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTG 

Ml llllllll I llllllll II IIIIIIM llllllll II Mill II Ml 

863 AGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGTTG 



447 



804 



Qy 


448 


Db 


803 


Qy 


508 


Db 


743 


Qy 


568 


Db 


683 


Qy 


628 


Db 


623 


Qy 


688 


Db 


564 


Qy 


748 


Db 


504 


Qy 


808 


Db 


444 


Qy 


868 


Db 


385 


Qy 


928 


Db 


348 


Qy 


988 


Db 


288 


Qy 


1048 


Db 


228 


Qy 


1108 


Db 


170 


Qy 


1168 


Db 


110 



CTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 507 

Mill llllllllllllll 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1! I II 1 1 ; I II 1 1 1 1 1 1 

CTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 744 

CTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCC 567 

II I M I I I I I I I I I I I I I I I !: I I II I I [ IIIIIIIIIIMIIIIIIIIIII III 
CTGTACCAGAAATACAGCAACAATYCCTGGCGATACCTCAGCAACCGGCTGCTGGCACCC 684 

AGTGAGTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACC 627 

II Mill M MMM I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III I 

AGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGC 624 
CGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGAT 687 

M Ml II II II II llllllll II lllllll llll MMMM Ml 

CGTGGAGGGGAAATTGA - GG CTTT CG C CTTAG CGCCCACTGCTC CTGTGA CAG CAGGGAT 56 5 
AACACACTGCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACC 74 7 

llllllll II Mill II IIIIIIIIII I I IIIIMII llllllllllllll! 

AACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCACC 505 
ATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAG 8 07 

Mill IIIIIIIIIIIIM MMMM 1 1 M 1 1 M MM !l I II I M 1 1 M 1 1 1 1 1 

ATTCATGGCATGAACCGGCCTTTCGTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAG 44 5 
CACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACTACTGCTTCAGCTCCACG 867 

II Mill 1 1 II 1 1 1 Ml 1 1 II 1 1 1 II I'l II 1 1 MMMM llllllllllllll 

CATCTGCAAAGCTCCCGGCACCGCCGAGCGCTGGACACCAACTATTGCTTCAGCTCCAC- 38 6 

GACTACAAGGATGACGACGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGAC 927 

Illllllllllllllllllllllllll IMIMIII 
GGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGAC 34 9 

TTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTC 987 

Mill MMMM llllllllllllll II II 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 M I III 

TTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTC 28 9 
TGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCT 104 7 

Mill IIIIMII IIIIMII llllllll Mill MMM: MMIMIMM 

TGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTAMMMCAAGGTCCTGGCC 22 9 
CTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTG 1107 

Mllllllllllll Mill Mill 1 1 1 1 1 1 1 1 1 1 E f I I 1 1 1 1 1 1 1 1 1 1 1 M 

MTGTACAACCAGCATAACCC-GGCGCCTCGGCGGCGCCGTNGT-CGTGCCGCAGGCGCTG 171 
GAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAAC 1167 

Mill MMMMMMMIMMIMM 1 1 1 1 1 1 1 III 1 1 1 II I M 1 1 1 1 1 1 1 1 II 

GAGCCGCTGCCCATCGTGTACTACGTGGGCTGCAAGCCCAAGGTGGAGCAGCTGTCCAAC 111 
ATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

IMMIMMI 1 1 1 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 

ATGATCGTGCGCTCCTGCAAGTGCAGCTGA 81 



RESULT 5 

BX324511/C 

LOCUS BX324511 



1041 bp 



mRNA linear EST 02-MAY-2003 



DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL, 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BX324511 Homo sapiens NEUROBLASTOMA COT 2 5 -NORMALI ZED Homo sapiens 

cDNA clone CS0DC024YD2 0 3 -PRIME, mRNA sequence. 

BX324511 

BX324511.1 GI :30332381 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 1041) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http: //www. genoscope. ens . fr/ 

cgi-bin/cluster . cgi?seq=CS0AC024DB10NP2&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0AC024DB10NP2 . 

Loca t i on/ Qua 1 i f i ers 

1. .1041 

/ organism= " Homo sapiens " 
/mol_type= M mRNA" 
/db_xref ="taxon: 9606" 
/clone="CS0DC024YD20" 

/tissue_type= "NEUROBLASTOMA COT 2 5 -NORMALI ZED" 
/clone_lib="Homo sapiens NEUROBLASTOMA COT 2 5 -NORMALI ZED" 
/note="lst strand cDNA was primed with a NotI -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
176 a 283 c 346 g 219 t 17 others 



Query Match 52.6%; Score 629.6; DB 13; 

Best Local Similarity 84.7%; Pred. No. 8.3e-132; 
Matches 788; Conservative 8; Mismatches 102; Indels 



Length 1041; 

32; Gaps 



7; 



Qy 269 GGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCACCC 32 8 

Ml I H IIIIIIIIIIIMI II :|: h II I h lllllll 

Db 1022 SGGGAGAKGSAGAACCGGAGCCCGACCCTRAS CCR ACTA YACCCCAAGRAGTCACCC 966 



Qy 

Db 

Qy 
Db 



32 9 GCGTGCTAATGGTGGAAAGCGGC^CCAAATCTATGATAAATTCAAGGGCACCCCCCACA 

MM hi III hi MM I MM llllllll II II MUM I Mill 

965 GCGTG YTAATGKT - GAAACCCACAACGAAATCTATGACAAGTTCAAGCAGAGTACACACA 



388 



907 



389 GCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGC 44 8 

II llllllll I llllllll II llllllll llllllll II Mill II MM 

906 GCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGTTGC 847 



Qy 



44 9 TCTCTC - GGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAG 5 07 



Db 



846 



Qy 


508 


Db 


786 


Qy 


568 


Db 


726 


Qy 


628 


Db 


666 


Qy 


688 


Db 


607 


Qy 


748 


Db 


547 


Qy 


808 


Db 


487 


Qy 


868 


Db 


428 


Qy 


928 


Db 


391 


Qy 


988 


Db 


331 


Qy 


1048 


Db 


271 


Qy 


1108 


Db 


213 


Qy 


1168 


Db 


153 



CTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCC 567 

II ! 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 lllllllll 1 1 1 1 II Ml 1 1 1 1 1 1 1 1 1 M 1 1 III 

CTGTACCAGAAATACAGCAACAATTCCTGGCGATACCTCAGCAACCGGCTGCTGGCACCC 727 
AGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACC 627 

II Mill II Mill I II i IM M MM 1 1 1 1 M Ml 1 1 1 1 1 M 1 1 M III I 

AGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGC 667 
CGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGAT 687 

II Ml M II II II Illlllll II lllllli MM Illlllll III 

CGTGGAGGGGAAATTGA - GG CTTTCG C CTTAG CG CC CACTGCTCCTGTGA CAGCAGGGAT 608 
AACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACC 74 7 

IIMIMI II Mill II M; IM I I Illlllll i I I ! 1 1 I I 1 1 1 1 1 1 

AACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCACC 548 
ATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAG 8 07 

Mill 1 1 ' I 1 1 1 1 ! ' 1 1 1 IIMIMI MIMMMMMMMIMMMIIMM 

ATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAG 4 88 
CACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACTACTGCTTCAGCTCCACG 867 

II Mill 1 1 1 1 1 1 1 MM! 1 1 i 1 1 1 IIMIMI MMIMMIMM 

CATCTGCAAAGCTCCCGGCACCGCCGAGCCCTGGACACCAACTATTGCTTCAGCTCCAC- 42 9 
GACTACAAGGATGACGACGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGAC 927 

I M 1 1 1 M M M II II 1 1 1 1! 1 1 M lllllllll 

GGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGAC 3 92 

TTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTC 987 

Mill Illlllll IMIIIIIMMM II II llllllllllllllllllll III 

TTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTC 332 

TGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCT 1047 

Mill Illlllll Illlllll Illlllll Mill llllllllllllllllllll 
TGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCC 272 

CTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTG 1107 

I I I M 1 1 1 M I M I II I II 1 1 1 1 1 llllllllllllll II Ml MM Ml III Ml 

CTGTACAACCAGCATAACCC-GGCGCCTCGGCGGCGCCGTG-TGCGTGCCGCAGGCGCTG 2 14 
GAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAAC 1167 

II II I M IMM I MUM I IIIIMI II II IM MIMMMMM II MIMI IM 

GAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCAAC 154 
ATGATCGTG CGTT C CTG CAAGTG CAG CTGA 1197 

IIMIIMMI 1 1 1 M 1 1 III 1 1 Ml M I 

ATG AT CGTG CG CT C CTG CAAGTG CAG CTGA 124 



RESULT 6 
BM555996 
LOCUS 

DEFINITION 



BM555996 1043 bp mRNA linear EST 20-FEB-2002 

AGENCOURT_654443 7 NIH_MGC_88 Homo sapiens cDNA clone IMAGE : 5550039 
5 ' , mRNA sequence . 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



BM555996 

BM555996. 1 GI : 18796907 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1043) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Ema i 1 : cgapbs - r@ma i 1 . n ih . gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Life Technologies, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

http: //image. llnl .gov 

Plate: LLAM12261 row: i column: 16 
High quality sequence stop: 702. 

Location/Qualifiers 

1. .1043 

/organism= n Homo sapiens" 
/ mo 1 _ t yp e = " mRNA " 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 5 550 039" 

/tissue_type= "duodenal adenocarcinoma, cell line" 
/lab_host="DH10B (phage-resistant) " 
/ c 1 one_l ib= " NI H_MGC_8 8 " 

/note="0rgan: small intestine; Vector: pCMV-SP0RT6; 
Site_l: Not I; Site_2 : Sail; Cloned unidirect ionally ; 
oligo-dT primed. Average insert size 1.767 kb. Library 
enriched for full-length clones and constructed by Life 
Technologies. Note: this is a NIH_MGC Library." 
202 a 329 c 345 g 154 t 13 others 



Query Match 52 . 6%; 

Best Local Similarity 88.1%; 
Matches 706; Conservative 



Score 629.4; DB 12; Length 1043; 
Pred. No. 9.2e-132; 
0; Mismatches 71; Indels 24; Gaps 



1; 



Qy 



Db 



386 ACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTAT 44 5 

Mill MINIM I MINIM II MINIM MINIM II INN II I 

1 ACAGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGT 60 



Qy 

Db 

Qy 
Db 

Qy 



44 6 TGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 5 05 

MM I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II 1 1 M I Ml I II MM I i 1 1 III II II I 

61 TGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 12 0 
5 06 AGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCC 565 

MM MMNNNNINN M 1 MM M 1 1 II III I M 1 1 II 1 1 Ml 1 1 I 

121 AGCTGTACC^GAAATAC^GCAACAATTCCTGGCGATACCTC^GCAACCGGCTGCTGGCAC 18 0 
566 CCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGA 62 5 

MM INN II MINI I II 1 1 II 1 1 1 1 1 li 1 1 II I II 1 1 1 1 1 M II II 111 



Db 



181 CCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGA 24 0 



Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



626 



241 



686 



CCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAG 

III III II II Mill MM. Ill II lllllll 1 1 1 1 MINIM I 

GCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGG 



685 



300 



745 



ATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGCGGCCGGGGTGACCTGGCCA 

1 1 1 1 1 1 1 1 1 1 II Mill M 1 1 1 1 1 1 1 1 1 1 I I MINIM IMIIIIIIIIII 

3 01 ATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCA 360 



74 6 CCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCC 

lllllll I I I I I I I t I I I I I I MINI I N I I I I I I I I I II I II I I I N I I II I 

3 61 CCATTCATGGCATGAACCGGCCTTTCGTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCC 

8 06 AGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAACTACTGCTTCAGCTCCA 

MM Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllll 1 IMIIIIIIIIII 

4 21 AGCATCTGCAAAGCTCCCGGCACCGCCGAGCCCTGGACACCAACTATTGCTTCAGCTCCA 



805 



420 



865 



480 



925 



8 66 CGGACTACAAGGATGACGACGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTG 

I INIINNN NNINMNI II lllllll 

481 C GGAGAAGAACTG CTG CGTG CGG CAG CTGTACATTG 516 

92 6 ACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATT 98 5 

lllllll lllllll MMIMMIMM II II II II II II I II 1 1 1 1 1 1 1 II I 

517 ACTTCCGCAAGGACCTCGGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACT 576 



986 



577 



TCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGG 104 5 
TCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGG 636 



104 6 CTCTGTACAACCAGCACAAGCCGGGCGCGTCGG CGG CGC CGTG CTGCGTGCCGCAGGCGC 1105 

I IMIIIIIIIIIM NIN II I : 

63 7 CCCTGTACAACCAGCATAACCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGC 6 96 
1106 TGGAGCCACTGCCC^TCGTGTACTACGTGGGCCGC^GCCCAAGGTGGAGCAGCTGTCCA 1165 

lllllll II IN I N 1 1 II I N I NM 1 1 II I II I II I II II I III II 1 1 1 N I I M I 

697 TGGAGCCGCTGCCCATCGTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGCAGCTGTCCA 756 
1166 ACATGATCGTGCGTTCCTGCA 118 6 

IMIIIIIIIIII lllllll 

757 ACATGATCGTGCGCTCCTGCA 777 



RESULT 7 

BX383773/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



BX383773 950 bp mRNA linear EST 08-MAY-2003 

BX383773 Homo sapiens HELA CELLS COT 25 -NORMALIZED Homo sapiens 
cDNA clone CSODK001YA15 3 -PRIME, mRNA sequence. 
BX383773 

BX383773 . 1 GI :30457168 
EST. 

Homo sapiens (human) 
Homo sapiens 
Eukaryota ; Metazoa ; 
Mammal ia ; Eutheria ; 
1 (bases 1 to 950) 
Li , W. B . , Gruber, C. , 



Chordata; Craniata; Vertebrata; Euteleostomi ; 
Primates; Catarrhini; Hominidae; Homo. 



Jessee,J. and Polayes,D. 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Full-length cDNA libraries and normalization 

Unpublished 

Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : //www. genoscope . ens . f r/ 

cgi -bin/cluster . cgi?seq=CS0DK00 lAA08NPl&cluster=9 160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DK001AA08NP1 . 

Location/Qual if iers 

1. .950 

/organism= " Homo sapiens " 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/clone="CS0DK001YA15" 

/cell_type="HELA CELLS COT 2 5 -NORMALI ZED" 
/cell_line="HELA" 

/clone_lib="Homo sapiens HELA CELLS COT 25 -NORMALI ZED" 
/note=" 1st strand cDNA was primed with a Not I -ol igo (dT) 
primer. Five prime end enriched, double- strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
171 a 260 c 312 g 195 t 12 others 



Query Match 50.4%; Score 603.6; DB 13; Length 950; 

Best Local Similarity 83.9%; Pred. No. 6e-126; 

Matches 717; Conservative 8; Mismatches 104; Indels 26; 



Gaps 



3; 



Qy 

Db 



343 GAAAGCGGC^CCAAATCTATGATAAATTCAAGGGCACCCCCCACAGCTTATATATGCTG 4 02 

h i 1 1 1 1 Milium ii mill i : i mini iiiiiiii i 

943 GRAACCCACAACGAAATCTATGACAAGTTCAAGCAGAGMACA 8 84 



Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



4 03 TTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAG 4 62 

1 1 M 1 : 1 1 II IIIIIIII IIIIIIII II hill II IIIIIIII MINIMI 

883 TTC^AMA(^T(^GAGCTCCGAGAAGCGGTACCTGVACCCGTGTTGCTCTCCCGGGC^GAG 824 

4 63 CTG CG C CTG CTGAGG CTCAAG TTAAAAGTGG AG CAG CA CGTGG AG CTATA CCAGAAATA C 522 

Mill I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I I 
823 CTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATAC 764 

523 AGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAG 582 

Mill II MIMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill Mill II III 

763 AGCAAC^TAMCTGGCGATACCTC^GC^ACCGGCTGCTGGCACCCAGCGACTCGCCAGAG 7 04 

583 TGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATA 642 

III I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III III III M II 
703 TGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATT 644 



643 



702 



GAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTG 

II II IIIIIIII II MIMM MM I MIMI III MIMM II III 

64 3 GA-GGCTTTCGCCTTAGCGCCCACTGCTCCTGTTACAGCAGGGATTACACACTGCAAGTG 58 5 



Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 



703 



584 



763 



524 



823 



464 



883 



GAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAAC 762 

II II MINIM I I I MINI I 1 1 1 1 : 1 II I II 1 1 1 1 1 II I MM! 

GACATCAACGGGTTTACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAAC 525 



CGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCC 

Mill llllllll II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml llllll 

CGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCC 



822 



465 



CGGCACCGCCGAGCCCTGGATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGAC 882 
! I 1 I I I I I I I I I ^ I I I I I | I I I || I I I II I I I I I I I I I I I 

CGGCACCGCCGAGCMCTGGACACCAACTATTGCTTCAGCTCCAC 421 



GACGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTG 942 

MM M I M M MM I M M I M M I MIMMMMMM llllllll 

4 20 GGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTKACTTCCGCAAGGACCTC 36 9 



943 



368 



GGCTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGT 1002 

MINI 1 1 1 1 1 II II II M I II I M MM 1 1 1 1 1 II I MINIM MM, 

GGCTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCGACTTCTGCCTCGGGCCCTGC 309 



10 03 CCCTACATCTGGAGCCTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCAC 1062 

llllllll llllllll Mill I II M 1 1 1 1 1 1 1 1 II I II 1 1 I I I 

3 08 CCCTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCCTGGCGCCTGTACAACCAGCA 24 9 

1063 AACCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATC 1122 

I II I I II I II MMIIIMI MM I I I I I I III I M I II I II I llllllll 
24 8 TAACC CGG CGC CT CGG CGG CG C CGTG - TG CGTGCCGCAGG CG CTGGAGC CG CTG CC CATC 190 

1123 GTGTACTACGTGGGCCGCAAGCCCAAGGTGGAGC^GCTGTCCAACATGATCGTGCGTTCC 1182 

llllllll II II M 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 M II II I llllllll IM 

18 9 GTGTACTACGTGGGCKKCAAGCCCAAGGTGGAGCAGCTGTCCNACATGATCGTGCGCTCC 13 0 



1183 TGCAAGTGCAGCTGA 1197 

i I I 1 I I I I I 1 I I I I I 
12 9 TGCAAGTGCAGCTGA 115 



RESULT 8 

BX434425/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BX434425 888 bp mRNA linear EST 15-MAY-2003 

BX434425 Homo sapiens PLACENTA Homo sapiens cDNA clone CS0DE014YE16 
3 -PRIME, mRNA sequence. 
BX434425 

BX434425. 1 GI :30779291 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 888) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
Library was constructed by Life Technologies, a division of 
Invitrogen. This sequence belongs to sequence cluster 9160. r For 
more information about this cluster, see 
http : //www . genoscope . ens . f r/ 

cgi-bin/cluster . cgi?seq=CSOBAK028AB08NMl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0BAK028AB08NM1 . 

Location/Qualifiers 

1. .888 

/ organ ism=" Homo sapiens" 

/mol_type="mRNA" 

/db_xref = M taxon: 9606" 

/clone="CS0DE014YE16" 

/ 1 i s sue_type= " PLACENTA " 

/clone_lib="Homo sapiens PLACENTA" 

/note= "Vector : pCMVSP0RT_6 ; 1st strand cDNA was primed 
with a NotI -oligo (dT) primer. Five prime end enriched, 
double-strand cDNA was digested with Not I and cloned into 
the Not I and EcoRV sites of the pCMVSPORT 6 vector. 
Library was not normalized." 
156 a 253 c 302 g 176 t 1 others 



Query Match 4 9.2%; 

Best Local Similarity 87.1%; 
Matches 679; Conservative 



Score 58 9; DB 13; Length 888; 
Pred. No. 1.2e-122; 
0; Mismatches 76; Indels 25; 



Gaps 



2; 



Qy 
Db 



418 CTCCGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGG 4 77 

MM MMMI M MMI I Ml I MMMMMMM MMMI I 

888 CTCCAAAAAGCGGTACCTGAACCCTTGTTGTCTNCCCGGGCAGAGCTGCGTCTGCTGA-G 830 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



478 CTCAAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGG 537 

MMMM Ml MMMMMMMMMI MMMMMMMMI MMMM 

82 9 CTC^GTTTAAAATGGAGCAGC^CGTGGAGCTGTACCAGAAATAC^GCAAa^TTCCTGG 77 0 
538 CGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGAT 5 97 

M MMMMMMMMMMMI I Ml Mill II IMMI I II MMI 

769 CGATACCTCAGCAACCGGCTGCTGGCACACAGCGACTCGCCAGAGTGGTTATCTTTTGAT 710 
5 98 GTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTC 657 

I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml III III II II Mill MMMM 

709 GTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTT 650 
658 AGTGCCCACTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTC 717 

II MMMI MM MMMM 1 1 1 1 1 1 1 1 1 1 1 II Mill II MMMMI 

64 9 AGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTC 59 0 



718 



589 



778 



529 



AATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTC 

I I I MMMM 1 1 1 1 1 1 1 1 1 1 1 11 E 1 1 1 1 1 1 IMMMMMMI MMMM 

ACTACCGGCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTT 



777 



530 



837 



CTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCCGAGCC 4 7 0 



Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



838 



469 



898 



433 



CTGGATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAG 

iiiii mill imii iiiiiiiii mill 

CTGGACACCAACTATTGCTTCAGCTCCACG GAGAAG 



897 



434 



957 



AACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATT 

E [ 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 I Mill ! M 1 1 II : 1 1 ! 1 1 1 

AACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGGCTGGAAGTGGATC 374 



958 CATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGC 1017 

M M 1 1 M 1 1 1 1 1 1 1 1 1 1 : II I i : IIIIMM II IIIII 1 1 II II 1 1 IIMM 

3 73 CACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGC 314 

1018 CTAGACACTCAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCG 1077 

II IMII II II 1 1 M 1 1 1 1 1 II I M II ! IIIIIIIII I - 1 1 1 1 1 1 1 1 1 : 1 1 III 

313 CTGGACACGCAGTACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGGGCGCCTCG 254 

1078 GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGC 1137 

1 1 1 1 1 1 1 1 II I II M 1 1 1 1 1 1 1 1 1 1 1 1 II MM M Mill MMMIMMMIMI 

2 53 GCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGTGTACTACGTGGGC 194 

113 8 CGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGTTCCTGCAAGTGCAGCTGA 1197 

II II II MM I M II II II II II II Ml II II I II II II I 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

193 CGCAAGCCCAAGGTGGAGCAGCTGTCCAACATGATCGTGCGCTCCTGCAAGTGCAGCTGA 134 



RESULT 9 
AL530081 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



Crania ta ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



FEATURES 

source 



AL530081 859 bp mRNA linear EST 23-MAY-2003 

AL530081 Homo sapiens NEUROBLASTOMA COT 5 0 -NORMALIZED Homo sapiens 
cDNA clone CS0DD009YM06 5 - PRIME , mRNA sequence. 
AL530081 

AL530081.2 GI :31067916 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 859) 

Li,W.B., Gruber,C, Jessee,J. and Polayes,D. 
Full-length cDNA libraries and normalization 
Unpublished 

On Feb 13, 2001 this sequence version replaced gi: 12793574. 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
was normalized. Library was constructed by Life Technologies, a 
division of Invitrogen. This sequence belongs to sequence cluster 
9160. r For more information about this cluster, see 
http : //www . genoscope . ens . f r/ 

cgi -bin/cluster . cgi?seq=CS0DD00 9BG03QPl&cluster=9 160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DD009BG03QP1 . 

Location/Qualifiers 

1. .859 

/organism="Homo sapiens" 



/ mo 1 _t ype = " mRNA " 
/db_xref = M taxon: 9606" 
/clone="CS0DD009YM06 M 

/tissue_t ype =*' NEUROBLASTOMA COT 50 -NORMALIZED" 
/clone_lib="Homo sapiens NEUROBLASTOMA COT 50 -NORMALIZED" 
/note=" 1st strand cDNA was primed with a Not I -oligo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 

BASE COUNT 179 a 272 c 257 g 148 t 3 others 

ORIGIN 



Query Match 48.2%; Score 577.2; DB 9; Length 859; 

Best Local Similarity 87.3%; Pred. No. 5.3e-120; 

Matches 641; Conservative 2; Mismatches 90; Indels 1; Gaps 1; 



Qy 


l 


ATGG CG C CTT CGGGG CTG CGG CT CTTGC CG CTG CTG CTG CCG CTG CTGTGGCTG CTAGTG 

III MM M MMIMMII III IIIMMII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II III 

ATGCCGCCCTCCGGGCTGCGGCTGCTGCTGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 


60 


Db 


126 


185 


Qy 


61 


CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 Mill -III MMMMMIMM 

CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 


120 


Db 


186 


245 


Qy 


121 


GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 

1 1 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 Mill Mill Mill M Mill Ml 

GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 


180 


Db 


246 


305 


Qy 

Db 


181 

J U D 


AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 

[ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MM Ml M II M 1 MM Mill II II II II 


240 
7 £ ^ 

J D O 


Qy 


241 


TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 

III 1 III 1 1 Mill MM Ml 1 Ml II 1 M II 1 1 ! Ill 

TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 


300 


Db 


366 


425 


Qy 


301 


GCGGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATC 

II illllll III IIIMM lllllllll II MM MIMIIMM 1 Mil Mill 

GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 


360 


Db 


426 


485 


Qy 


361 


TATGATAAATTC^GGGC^CCCCCC^CAGCTTATATATGCTGTTC^ACACGTCGGAGCTC 

Mill II MUM 1 1 llllll 1 1 1 1 1 1 1 1 1 Illllll! II llllll 

TATGAC^GTTC^GC^GAGTACACACAGCATATATATGTTCTTCAACACATCAGAGCTC 


420 


Db 


486 


545 


Qy 


421 


CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 

II Illllll II Mill II IIIMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 


480 


Db 


546 


605 


Qy 


481 


AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 

MMMMMIMM Mill 1 1 II 1 III IM Mill M IMMIIMM 

AAGTTAAAAGTGGAGCAGCACGTGGAGCTGTACCAGAAATACAGCAACAATTCCTGGCGA 


540 


Db 


606 


665 


Qy 


541 


TACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTG-GCTGTCCTTTGATGT 

MMIMMII IMIMMIM Mill Mill II Mill 1 1 II MUM 

TACCTCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGTGTTATCTTTTGATGT 


599 


Db 


666 


725 



Qy 



600 CACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAG 65 9 



Db 

Qy 

Db 

Qy 

Db 



MM II II III Ml III II II Mill IMMMI II 

72 6 CACCGGAGTTGTGCGGCAGTKGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAG 78 5 
660 TGCCCTVCTCTTCCTCTGACAGCAAAGATAAC^C^CTCCACGTGGAAATTAACGGGTTC^A 719 

Illllll MM IMMMI MMMMMI M .IN II MMMIMI 

78 6 CGCCCAGTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGRACATCAACGGGTTCAC 84 5 

72 0 TTCTGGCCGCCGGG 733 

I I llllllll I 
84 6 TACCGGCCGCCGAG 859 



RESULT 10 

BE312000 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BASE COUNT 



BE312000 713 bp mRNA linear EST 26-OCT-2000 

601154768F1 NIH_MGC_19 Homo sapiens cDNA clone IMAGE : 3510592 5', 
mRNA sequence. 
BE312000 

BE312000.1 GI:9130128 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hommidae; Homo. 

1 (bases 1 to 713) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Ling Hong/Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: image.llnl.gov 
Plate: LLCM195 row: h column: 17 
High quality sequence start: 2 
High quality sequence stop: 713. 
Location/Qualifiers 
1. .713 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon: 9606" 

/clone= H IMAGE: 35105 92" 

/ 1 i s sue_t ype= " neurobl a s t oma " 

/lab_host="DH10B (phage-resistant ) " 

/clone_lib="NIH_MGC_19" 

/note="0rgan: brain; Vector: pOTB7; Site_l : Xhol ; Site_2 : 
EcoRI; cDNA made by oligo-dT priming. Direct ionally 
cloned into EcoRI/XhoI sites using the following 5' 
adaptor: GGCACGAG ( G ) . Library constructed by Ling Hong 
in the laboratory of Gerald M. Rubin (University of 
California, Berkeley) using ZAP-cDNA synthesis kit 
(Stratagene) and Superscript II RT (Life Technologies) . 
Note: this is a NIH_MGC Library." 
157 a 226 c 211 g 119 t 



ORIGIN 



Query Match 47.7%; Score 570.6; DB 10; Length 713; 

Best Local Similarity 87.5%; Pred. No. 1.6e-118; 

Matches 624; Conservative 0; Mismatches 89; Indels 0; Gaps 0; 

AG CGGAAG CG CAT CGAGG CCATT CG CGG C CAGATTCTGT CCAAG CTT CGG CTTG C CAG CC 184 

III llllllllllll MM MMMMMI MMMMMI MMI MMIM 

AGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCC 6 0 
CCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACA 244 

MMMMMMMM MMMMMMMMMM MMI M M M M MM 

CCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACA 12 0 
ACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGG 3 04 

MM MMMMMMM MMMM MM MMMMMMMMI MMI I 

ACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCG 18 0 
ACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATG 364 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I MM 1 1 1 1 1 1 1 M 

ACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATG 24 0 
ATAAATT(^GGGCACCCCCC^C^GCTTATATATGCTGTT(^CACGTCGGAGCTCCGGG 424 

I 1 1 MUM I I llllll MMMM I MMMM II MMMM I 

ACAAGTTC^GCAGAGTACA.C^CAGCA^ 3 00 

AAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGT 4 84 
AAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGT 360 
TAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACC 544 

I J I 1 1 1 1 1 1 1 ; 1 1 1 1 1 1 1 1 1 1 1 1 1 I [ I ! 1 1 i 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 I M MM 

TAAAAGTGGAGCAGC^CGTGGAGCTGTACCAGAAATAC^GCAACAATTCCTGGCGATACC 42 0 
TC^GC^ACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCG 6 04 

I II 1 1 1 1 1 II 1 1 Ml M I MMI Mill II llllll I II lllllllllllll 

TCAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCG 4 8 0 
GAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCC 664 

MMIM MSIIII Ml III III M M MMI llllll II MM 

GAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCC 54 0 
ACTCTTCCTCTGACAGOW^GATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTG 724 

III MM MMMM MMMMMI II Mill II MINIM, I I I 

ACTGCTCCTGTGAC^G(^GGGATAA(^(^CTGC^GTGGA(^T(^CGGGTTCACTACCG 60 0 
GCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGG 784 

MMIM MMMMMMMMMM MIMIIIMMII MMMM MMIM 

GCCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGG 660 
CCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCC 837 

IIIIIMIIIIMIIIIIIIMIM Mill MMIM I I MUM 

CCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCCGGCACCGCGAGCC 713 



Qy 


125 


Db 


1 


Qy 


185 


Db 


61 


Qy 


245 


Db 


121 


Qy 


305 


Db 


181 


Qy 


365 


Db 


241 


Qy 


425 


Db 


301 


Qy 


485 


Db 


361 


Qy 


545 


Db 


421 


wy 


DUD 


Db 


481 


Qy 


665 


Db 


541 


Qy 


725 


Db 


601 


Qy 


785 


Db 


661 



RESULT 11 
BE260971 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



BASE COUNT 
ORIGIN 



BE260971 717 bp mRNA linear EST 26-OCT-2000 

601153715F1 NIH_MGC_19 Homo sapiens cDNA clone IMAGE : 350993 1 5', 
mRNA sequence. 
BE260971 

BE260971 .1 GI : 9132709 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 717) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Ling Hong/Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: image.llnl.gov 
Plate: LLCM193 row: m column: 04 
High quality sequence stop: 713. 
Location/Qualifiers 
1. .717 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone=" IMAGE: 3 509931" 
/ 1 i s sue_type= " neurobl a s t oma " 
/lab_host="DH10B (phage-resistant) " 
/ c 1 one_l i b= " N I H_MGC_1 9 " 

/note="0rgan: brain; Vector: pOTB7; Site_l: Xhol ; Site_2 : 
EcoRI; cDNA made by oligo-dT priming. Directionally 
cloned into EcoRI/XhoI sites using the following 5' 
adaptor: GGCACGAG (G) . Library constructed by Ling Hong 
in the laboratory of Gerald M. Rubin (University of 
California, Berkeley) using ZAP-cDNA synthesis kit 
(Stratagene) and Superscript II RT (Life Technologies) . 
Note: this is a NIH_MGC Library." 
158 a 226 c 213 g 120 t 



Query Match 47 . 0%; 

Best Local Similarity 87.4%; 
Matches 627; Conservative 



Score 562.6; DB 10; Length 717; 
Pred. No. le-116; 
0; Mismatches 89; Indels 1; 



Gaps 



1; 



Qy 

Db 



126 GCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCC 18 5 

Illllllllllllllllllll lllllllllll lllllllllll Mill llllllll 

1 GCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCCC 60 



Qy 

Db 



186 



61 



CCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAA 

lllllllllll III llllllllllllllllllll Mill II II II II Mill 

CCCGAGCCAGG - GGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACAA 



245 



119 



Qy 24 6 CAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGA 3 05 



Db 


120 


Qy 


306 


Db 


180 


Qy 


366 


Db 


240 


Qy 


426 


Db 


300 


Qy 


486 


Db 


360 


Qy 


546 


Db 


420 


Qy 


606 


Db 


480 


Qy 


666 


Db 


540 


Qy 


726 


Db 


600 


Qy 


786 


Db 


660 



Ml MINIMI III MMIMI MM MMMMIMMMM MMI 1 1 

CAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGA 179 
CTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGA 365 

MMMMMMMMMMMMMMMMMMMMI I MM MMMMM 

CTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATCTATGA 23 9 
TAAATTCAAGGGCACCCCCCACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGA 42 5 

M MMM I I MMM MMMM I MMMM 1 1 MMMM 1 1 

CAAGTT CAAG CAGAGTACACACAGCATATATATGTT CTT CAACACATCAGAG CTC CGAGA 2 99 
AGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTT 4 8 5 

MMM M MMI II MMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 II 1 1 1 1 1 M II I 

AGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTT 35 9 
AAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCT 54 5 

1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMMM Mill 

AAAAGTGGAGCAGCACGTGGAGCTGTACC^GAAATACAGCAACAATTCCTGGCGATACCT 419 
CAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGG 6 05 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMI Mill II MMM I M IIIIIIMIIM.il 

CAGCAACCGGCTGCTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGG 4 79 
AGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCA 665 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml Ml II! M M Mill MMMM II MMI 

AGTTGTG CGG CAGTGGTTGAGCCGTGGAGGGGAAATTGAGGG CTTT CG C CTTAGCG CCCA 53 9 
CTCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGG 725 

M MM MMMM I E 1 1 1 1 1 1 1 i I II MMI II MMMMM I I M 

CTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGG 5 99 
CCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGC 785 

MMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MMMM MMMM 

CCGCCGAGGTGACCTGGCCACCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGC 659 
CACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGA 84 2 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill III III MM II III 

CACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCGGCACCGCGAGCCCTGGGA 716 



RESULT 12 

BQ675698 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



BQ675698 902 bp mRNA linear EST 15-JUL-2002 

AGENCOURT__8 036532 NIH_MGC_102 Homo sapiens cDNA clone IMAGE : 62 1 1917 
5', mRNA sequence. 
BQ675698 

BQ675698 . 1 GI :21786532 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 902) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 



FEATURES 

source 



BASE COUNT 
ORIGIN 



Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: ATCC 
cDNA Library Preparation: Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consort ium/LLNL at: 

ht tp : / / image . 1 lnl . gov 

Plate: LLCM2375 row: c column: 22 

High quality sequence stop: 599. 
Location/Qual if lers 
1. .902 

/ organi sm= " Homo sap i ens " 
/mol_type= 11 mRNA " 
/db_xref ="taxon: 96 06" 
/clone-" IMAGE: 6211917" 

/tissue_type="epidermoid carcinoma, cell line" 
/lab_host="DH10B (phage-resistant) " 
/clone_lib="NIH_MGC_102" 

/note="0rgan: salivary gland; Vector: p0TB7; Site_l: Xhol ; 
Site_2: EcoRI; cDNA made by oligo-dT priming. 
Direct ionally cloned into EcoRI /Xhol sites using the 
following 5 1 adaptor: GGCACGAG ( G ) . Library constructed 
by Ling Hong in the laboratory of Gerald M. Rubin 
(University of California, Berkeley) using ZAP-cDNA 
synthesis kit (Stratagene) and Superscript II RT (Life 
Technologies). Note: this is a NIH_MGC Library." 
194 a 292 c 255 g 161 t 



Query Match 46.9%; 
Best Local Similarity 87.4%; 
Matches 615; Conservative 



Score 561.6; DB 13; 
Pred. No. 1.8e-116; 
0; Mismatches 89; 



Length 902; 



Indels 



0 ; Gaps 



Qy 
Db 



14 6 TTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCCAGCCCCCCGAGCCAGGGGGACGTGC 205 

I MINIUM MMMMMI MMI MMMMMMIMMMMM MM 

1 TCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCCAGCCCCCCGAGCCAGGGGGAGGTGC 60 



Qy 
Db 

Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 
Db 



2 06 CGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTTTACAACAGTACCCGCGACCGGGTAG 265 

MMMMMMMM MMI M M M M 1 1 1 1 M 1 1 MMMMMMM I 

61 CGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTGTACAACAGCACCCGCGACCGGGTGG 12 0 



266 



325 



CCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAGGCGGACTACTACGCCAAGGAGGTCA 

IMMM MM MIMMMMMMM MMI 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 

121 CCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAGGCCGACTACTACGCCAAGGAGGTCA 18 0 
3 26 CCCGCGTGCTAATGGTGGAAAGCGGCAACCAAATCTATGATAAATTCAAGGGCACCCCCC 38 5 

I M 1 1 II 1 1 1 II II 1 1 1 1 1 1 1 I MM 1 1 1 1 1 1 1 1 1 1 II MUM I I ! 

181 CCCGCGTGCTAATGGTGGAAACCC^C^CGAAATCTATGAC^GTTC^AGCAGAGTACAC 24 0 
3 86 ACAGCTTATATATGCTGTTCAACACGTCGGAGCTCCGGGAAGCGGTGCCGGAACCTGTAT 44 5 

MMI M M 1 1 1 1 I MM II MMMM MMMM II Mill II I 

241 ACAGCATATATATGTTCTTCAACACATCAGAGCTCCGAGAAGCGGTACCTGAACCCGTGT 300 



446 



505 



TGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 

IMMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 II 1 1 1 1 1 1 1 1 

3 01 TGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTCAAGTTAAAAGTGGAGCAGCACGTGG 360 



Qy 5 06 AGCTATACCAGAAATACAGCAATGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCC 565 

MM MMMMMMMMI MMMMM MMMMMMMMMMM! I 

Db 3 61 AG CTGTAC CAGAAATA CAG CAA CAATTCCTGG CG ATA C CT CAG CAA C CGG CTG CTGG CA C 42 0 

Qy 566 CCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGA 62 5 

MM MMI M MMM I M M M M M M M M M M M M M M M M Ml 

Db 421 CCAGCGACTCGCCAGAGTGGTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGA 48 0 

Qy 626 CCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGCAAAG 685 

Ml III II M Mill 1 1 1 1 ! 1 1 1 II MM MM 1 1 1 1 1 1 1 1 I 

Db 481 GCCGTGGAGGGGAAATTGAGGGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGG 54 0 

Qy 686 ATAACACACTCGACGTGGAAATTAACGGGTTCAATTCTGGCCGCCGGGGTGACCTGGCCA 74 5 

MMMMM II Mill II MMMMM I I 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 

Db 541 ATAACACACTGCAAGTGGACATCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGCCA 60 0 

Qy 74 6 CCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCC 8 05 

I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I 
Db 6 01 CCATTCATGGCATGAACCGGCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCC 660 

Qy 8 06 AGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACCAAC 84 9 

MM MM I I I I I I I I I I I I I I I I I 1 I I I I I I I I II 

Db 661 AGCATCTGCCAAGCTCCCGGCACCGCCGAGCCCTGGGACACCAC 704 



RESULT 13 

BI818841 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



bp mRNA linear EST 04-OCT-2 001 
sapiens cDNA clone IMAGE : 5178433 5', 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



BI818841 925 
603037307F1 NIH_MGC_115 Homo 
mRNA sequence. 
BI818841 

BI818841.1 GI:15930391 
EST. 

Homo s ap i ens ( human ) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 925) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs -r@mail .nih.gov 

Tissue Procurement: Life Technologies, Inc. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http: //image. llnl .gov 
Plate: LLAM11444 row: n column: 02 
High quality sequence stop: 874. 

Loca t i on/Qua 1 i f i er s 

1. .925 

/organism="Homo sapiens" 
/mol_type= M mRNA M 



/db_xref="taxon: 9606" 
/clone=" IMAGE: 5178433" 
/lab_host="DH10B" 
/clone_lib="NI H_MG C_ 1 1 5 " 

/note="Organ : pooled brain, lung, testis; Vector: 
pCMV-SP0RT6; Site_l : Not I; Site_2 : EcoRV (destroyed); RNA 
source anonymous pool of 6 male brains, age range 23-2 7; 1 
male lung, age 27; and 1 male testis, age 69. Library is 
oligo-dT primed and direct ionally cloned (EcoRV site is 
destroyed upon cloning) . Average insert size 1.8 kb, 
insert size range 1-3 kb. Library is normalized and 
enriched for full-length clones and was constructed by C. 
Gruber (Invitrogen) . Research Genetics tracking code 
021. Note: this is a NIH_MGC Library." 

BASE COUNT 170 a 316 c 278 g 161 t 

ORIGIN 

Query Match 44.6%; Score 534.2; DB 12; Length 925; 

Best Local Similarity 87.6%; Pred . No. 2.8e-110; 

Matches 607; Conservative 0; Mismatches 83; Indels 3; Gaps 2; 
Qy 1 ATGGCGCCTTCGGGGCTGCGGCTCTTGCCGCTGCTGCTGCCGCTGCTGTGGCTGCTAGTG 6 0 

III 1 1 1 1 II lllllllllll MINIMUM MMMMMMI II 1 1 1 

Db 151 ATGCCGCCCTCCGGGCTGCGGCTGCTGCCGCTGCTGCTACCGCTGCTGTGGCTACTGGTG 210 

Qy 61 CTGACGCCTGGCCGGCCGGCCGCCGGACTGTCCACCTGCAAGACCATCGACATGGAGCTG 12 0 

IMMMMMMMMMMM Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

Db 211 CTGACGCCTGGCCGGCCGGCCGCGGGACTATCCACCTGCAAGACTATCGACATGGAGCTG 270 

Qy 121 GTGAAGCGGAAGCGCATCGAGGCCATTCGCGGCCAGATTCTGTCCAAGCTTCGGCTTGCC 18 0 

MIMIMI MIM IMIIMIM I III! I lllllllllll Mill III 

Db 271 GTGAAGCGGAAGCGCATCGAGGCCATCCGCGGCCAGATCCTGTCCAAGCTGCGGCTCGCC 33 0 

Qy 181 AGCCCCCCGAGCCAGGGGGACGTGCCGCCCGGCCCGCTGCCTGAGGCAGTACTGGCTCTT 24 0 

1 1 1 1 1 1 1 1 F 1 1 1 1 M I F 1 1 1 M IM II MINIM Mill II 1 1 II II 

Db 331 AGCCCCCCGAGCCAGGGGGAGGTGCCGCCCGGCCCGCTGCCCGAGGCCGTGCTCGCCCTG 3 90 

Qy 241 TACAACAGTACCCGCGACCGGGTAGCCGGGGAAAGTGTCGAACCGGAGCCCGAGCCAGAG 3 00 

MINIM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 MINIM MM II M 1 1 M M 1 1 1 1 1 1 1 III 

Db 3 91 TACAACAGCACCCGCGACCGGGTGGCCGGGGAGAGTGCAGAACCGGAGCCCGAGCCTGAG 450 

Qy 3 01 GCGGACTACTACGCC^GGAGGTC^CCCGCGTGCTAATGGTGGAAAGCGGC^Cau^TC 3 60 

II ' 1 1 I MINI 1 : 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I MM Mill 

Db 4 51 GCCGACTACTACGCCAAGGAGGTCACCCGCGTGCTAATGGTGGAAACCCACAACGAAATC 510 

Qy 361 TATGATAAATTCAAGGGCACCCCCCAC^GCTTATATATGCTGTTC^CACGTCGGAGCTC 42 0 

Mill II MINI I I INN INN I MINIM II MINI 

Db 511 TATGACAAGTTCAAGCAGAGTACAC^ 57 0 

Qy 421 CGGGAAGCGGTGCCGGAACCTGTATTGCTCTCTCGGGCAGAGCTGCGCCTGCTGAGGCTC 4 8 0 

II MINIM II Mill II llllllll 1 1 J [ 1 1 1 1 1 1 J 1 1 i MINIMUM 

Db 571 CGAGAAGCGGTACCTGAACCCGTGTTGCTCTCCCGGGCAGAGCTGCGTCTGCTGAGGCTC 63 0 

Qy 4 81 AAGTTAAAAGTGGAGCAGCACGTGGAGCTATACCAGAAATACAGCAATGATTCCTGGCGC 54 0 

I Ml 1 1 1 1 1 1 1 1 1 IN I II II 1 M II 1 1 1 N I N I II I II 1 1 1 1 1 1 IMIIMIM 

Db 631 AAGTTAAAAGTGGAG(^GCACGTGGAGCTGTAC(>GAAATAC^Ga^CAATTCCTGGCGA 690 



Qy 
Db 

Qy 
Db 

Qy 
Db 



541 TACCTCAGCAACCGGCTGC - TGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGT 599 

Illllllllllllllllll 1 1 1 1 Mill Mill II llllll I II Mllllll 

691 TACCTCAGCAACCGGCTGCTTGGCACCCAGCGACTCGCCAGAGTGGTTATCTTTTGATGT 75 0 
6 00 CACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGA - - GAGGCTATAGAGGGTTTTCGCCTC 657 

IMIMIMIMIIIMIIIII III III II I I II Mill Mllllll 

751 CACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGACGGGGAACATTGAGGGCTTTCGCCTT 810 



658 



690 



AGTGCCCACTCTTCCTCTGACAGCAAAGATAAC 

II llllll MM Mllllll II II I 

811 AGCGCCCACTGCTCCTGTGACAGCAGGGATACC 843 



RESULT 14 

BQ054305 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BQ054305 1093 bp mRNA linear EST 29-MAR-2002 

AGENCOURT_68 3 04 09 NIH_MGC_106 Homo sapiens cDNA clone IMAGE : 5936397 
5', mRNA sequence. 
BQ054305 

BQ054305.1 GI:19813645 
EST. 

Homo sap l ens ( human ) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1093) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 
Unpublished 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r@mail .nih.gov 

Tissue Procurement: Dr. Daniel McVicar, DBS/NCI 
cDNA Library Preparation: Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Agencourt Bioscience Corporation 
Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/ LLNL at: 
http: //image. llnl.gov 
Plate: LLCM2125 row: k column: 22 
High quality sequence stop: 605. 
Location/Qualifiers 
1. .1093 

/ organi sm= " Homo sapiens 11 
/ mo 1 _t ype = " mRNA " 
/db_xref="taxon: 9606" 
/clone=" IMAGE: 5936397" 

/tissue_type="natural killer cells, cell line" 
/lab_host="DH10B (phage-resistant ) » 
/clone__lib="NIH_MGC_106" 

/not e»" Organ: blood; Vector: pOTB7; Site___l : Xhol ; Site_2 : 
EcoRI ; cDNA made by oligo-dT priming. Direct lonally cloned 
into EcoRI/XhoI sites using the following 5' adaptor: 
GGCACGAG (G) . Library constructed by Ling Hong in the 
laboratory of Gerald M. Rubin (University of California, 
Berkeley) using ZAP-cDNA synthesis kit (Stratagene) and 
Superscript II RT (Life Technologies) . Note: this is a 
NIH_MGC Library. " 



BASE COUNT 233 a 347 c 309 g 203 t 1 others 

ORIGIN 



Query Match 43.5%; Score 521.2; DB 12; Length 1093; 

Best Local Similarity 87.8%; Pred. No. 2.6e-107; 

Matches 590; Conservative 0; Mismatches 58; Indels 24; Gaps 1; 
Qy 52 5 CAA.TGATTCCTGGCGCTACCTCAGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTG 584 

III I 1:1 Mill Mill II Mill 

Db 1 (^CAATTCCTGGCGATACCT(^GC^CCGGCTGCTGGC^CC(^GCGACTCGCCAGAGTG 60 

Qy 58 5 GCTGTCCTTTGATGTCACCGGAGTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGA 644 

I I M I M 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml Ml III II II II 

Db 61 GTTATCTTTTGATGTCACCGGAGTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTGA 12 0 

Qy 64 5 GGGTTTTCGCCTCAGTGCCCACTCTTCCTCTGACAGOU^GATAAC^a^CTCCACGTGGA 7 04 

III Mill II III MM MM II I Ml II I Mill I MM II Mill 

Db 121 GGGCTTTCGCCTTAGCGCCCACTGCTCCTGTGACAGCAGGGATAACACACTGCAAGTGGA 18 0 

Qy 7 05 AATTAACGGGTTO^ATTCTGGCCGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCG 764 

II 1 1 1 1 1 1 1 1 1 1 I I I Ml II II 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III II I Mill 

Db 181 C^TCAACGGGTTCACTACCGGCCGCCGAGGTGACCTGGC(^C(^TT(^TGGCATGAACCG 24 0 

Qy 765 GCCCTTCCTGCTCCTCATGGCCACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCG 824 

III llllllll I II I II III Mill Mill I III II MIMM Mill llllllll 

Db 241 GCCTTTCCTGCTTCTCATGGCCACCCCGCTGGAGAGGGCCCAGCATCTGCAAAGCTCCCG 30 0 

Qy 825 GCACCGCCGAGCCCTGGATACCAACTACTGCTTCAGCTCCACGGACTACAAGGATGACGA 884 

1 1 II I M 1 1 1 1 1 1 1 1 1 1 1 llllllll Mill II 1 1 Mill 

Db 301 GCACCGCCGAGCCCTGGACACCAACTATTGCTTCAGCTCCAC 342 

Qy 885 CGACAAGGAGAAGAACTGCTGCGTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGG 944 

I I I I I I I I I I I I I I I I I 1 I I I I I I I I I llllllll II llllllll II 
Db 343 GGAGAAGAACTGCTGCGTGCGGCAGCTGTACATTGACTTCCGCAAGGACCTCGG 3 96 

Qy 94 5 CTGGAAGTGGATTCATGAACCCAAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCC 1004 

MM MIMM I II II III MM I III II I II I II I llllllll llllllll II 

Db 3 97 CTGGAAGTGGATCCACGAGCCCAAGGGCTACCATGCCAACTTCTGCCTCGGGCCCTGCCC 4 56 

Qy 10 05 CTACATCTGGAGCCTAGACACTCAGTAC^GC^GGTCCTGGCTCTGTACAACC^G(^(^ 1064 

llllll llllllll Mill I III 1 1 il II 1 1 1 1 1 1 ! I I 1 1 1 1 1 1 1 1 1 1 ! II 

Db 4 57 CTACATTTGGAGCCTGGACACGCAGTACAGCAAGGTCC^ 516 

Qy 1065 CCCGGGCGCGTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGT 1124 

IIIMIMI 1 1 1 1 1 1 M 1 1 I II 1 1 1 II II 1 1 1 ! 1 1 M M 1 1 1 1 1 II M I 

Db 517 CCCGGGCGCCTCGGCGGCGCCGTGCTGCGTGCCGCAGGCGCTGGAGCCGCTGCCCATCGT 576 

Qy 112 5 GTACTACGTGGGCCGCAAGCCCAAGGTGGAGCA.GCTGTCCAACATGATCGTGCGTTCCTG 1184 

M 1 1 1 1 1 1 1 1 1 1 1 1 1 M I M II I II 1 1 M I II 1 1 1 1 1 M I M I Ml 1 1 1 1 1 1 1 Mill 

Db 577 GTACTACGTGGGCCGCAAGCCCAAGGTGGAGC^GCTGTCCAACATGATCGTGCGCTCCTG 63 6 

Qy 118 5 CAAGTGCAGCTG 1196 

llllllll 

Db 637 CAAGTGCAGCTG 64 8 



RESULT 15 



AL530080/C 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



AL530080 841 bp mRNA linear EST 23-MAY-2003 

AL530080 Homo sapiens NEUROBLASTOMA COT 5 0 -NORMALI ZED Homo sapiens 
cDNA clone CS0DD009YM06 3 -PRIME, mRNA sequence. 
AL530080 

AL53 008 0.2 GI : 31067915 
EST. 

Homo sapiens (human) 
Homo sapiens 
Eukaryota ; Metazoa ; 
Mammalia; Eutheria; 
1 (bases 1 to 841) 
Li,W.B. , Gruber,C. , 



FEATURES 

source 



Chordata; Craniata; Vertebrata; Euteleostomi ; 
Primates; Catarrhini; Hominidae; Homo. 



and Pol ayes, D. 



BASE COUNT 
ORIGIN 



Jessee, J. 

Full-length cDNA libraries and normalization 
Unpubl ished 

On Feb 13, 2001 this sequence version replaced gi : 12793573. 
Contact : Genoscope 

Genoscope - Centre National de Sequencage 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 

was normalized. Library was constructed by Life Technologies, a 

division of Invitrogen. This sequence belongs to sequence cluster 

9160. r For more information about this cluster, see 

http: //www. genoscope. ens . fr/ 

cgi -bin/cluster . cgi?seq=CS0DD00 9BG03NPl&cluster=9160 . r . Contact : 
Feng Liang Email : fliang@lifetech.com URL : 

http://fulllength.invitrogen.com/ InVitroGen Corporation 1600 
Faraday Avenue Genoscope sequence ID : CS0DD009BG03NP1 . 

Location/Qualifiers 

1. .841 

/organism="Homo sapiens" 
/ mo 1 _t yp e = " mRNA " 
/db_xref="taxon: 9606" 
/clone="CS0DD009YM06" 

/tissue_type= "NEUROBLASTOMA COT 50 -NORMALIZED" 
/clone_lib="Homo sapiens NEUROBLASTOMA COT 5 0 -NORMALIZED" 
/note="lst strand cDNA was primed with a NotI -ol igo (dT) 
primer. Five prime end enriched, double-strand cDNA was 
digested with Not I and cloned into the Not I and EcoR V 
sites of the pCMVSPORT 6 vector. Library was normalized." 
150 a 240 c 283 g 153 t 15 others 



Query Match 43.5%; Score 520.2; DB 9; Length 841; 

Best Local Similarity 84.0%; Pred. No. 4e-107; 

Matches 647; Conservative 7; Mismatches 86; Indels 30; Gaps 



5; 



QY 
Db 

Qy 

Db 

Qy 



428 CGGTGCCGGAACCTGTATTGCTCT - CTCGGGCAGAGCTGCGCCTGCTGAGGCTCAAGTTA 4 86 

1 1 1 1 II hill I I- I IIIIIIIIIIMI Mill I II M 

841 CGGTACCTGMACCCTTGTTGYTCTCCCCGGGCAGAGCTGCTTCTGCTAAGGCTCAAGTTA 782 

4 87 AAAGTGGAGCAGCACGTGGAGCTATACC^GAAATACAGCAATGATTCCTGGCGCTACCTC 546 

11:1 llllll'IM III lllllllllllllllll hlllllll llllll 
781 AAAGTGGAGC^GCACGTGGAGCTGTACC^GAAATACAGCAACAAW 722 

547 AGCAACCGGCTGCTGGCCCCCAGTGACTCACCGGAGTGGCTGTCCTTTGATGTCACCGGA 606 

llllllhll MINI Mill Mill II UN I II MM MINI 



Db 721 AGCAACCSGCSGCTGGCACCCAGCSACTCGCCAGAGTGGTTTTCTTTTGTTGTCACCGKA 662 

Qy 6 07 GTTGTGCGGCAGTGGCTGACCCGCAGAGAGGCTATAGAGGGTTTTCGCCTCAGTGCCCAC 666 

lllllllll Mil III Ml III II II III II II II II III II 

Db 661 GTTGTGCGGCAGTGGTTGAGCCGTGGAGGGGAAATTAAGG- -TTTCGCCTTAGCGCCNAC 604 

Qy 667 TCTTCCTCTGACAGCAAAGATAACACACTCCACGTGGAAATTAACGGGTTCAATTCTGGC 72 6 

I I I I I MINI M MM II Ml I II I M M I II III 

Db 6 03 GGCTCCTGTAACAGCAGGGATAACACACTGCAAGTGAACATCAACGGGTTAACTACCGGC 544 

Qy 727 CGCCGGGGTGACCTGGCCACCATTCACGGCATGAACCGGCCCTTCCTGCTCCTCATGGCC 78 6 

Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IMMMMMMI I M M M I lllllllll 

Db 543 CGCCGAGGTGACCTGGCCACCATTCTTGGCATGAACCGGCCTTTCCTGCTTCTCATGGCC 4 84 

Qy 787 ACCCCGCTGGAGAGGGCCCAGCACCTGCACAGCTCCCGGCACCGCCGAGCCCTGGATACC 84 6 

1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 

Db 4 83 ACCCCGCTGGAGAGGGCCCAGCTTCTGCAAAGCTCCCGGCACCGCCGAGCCCTGGACACC 424 

Qy 847 AACTACTGCTTCAGCTCCACGGACTACAAGGATGACGACGACAAGGAGAAGAACTGCTGC 906 

III I lllllllll II II Ml MINIM 

Db 423 AACTATTGCTTCAGCTCCACGG AGAAGAACTGCTGC 388 

Qy 907 GTGCGGCAGCTCTACATTGACTTCCGGAAGGACCTGGGCTGGAAGTGGATTCATGAACCC 966 

lllllllllll MUM MM IMIIIII MUM MM II II III 

Db 387 GTGCGGCAGCTGTACATTAACTTCCGCAAGGACCTCGGCTGGGAGTGGATCCACGAGCCC 32 8 

Qy 967 AAGGGCTACCATGCCAATTTCTGCCTGGGGCCCTGTCCCTACATCTGGAGCCTAGACACT 1026 

1 1 1 II M 1 1 1 1 i 1 1 I II 1 1 II M IMIIIII IMIIIII IMIIIII Mill 

Db 327 AAGGGCTWCCATGCCAACTTCTGCCTCGGGCCCTGCCCCTACATTTGGAGCCTGGACACG 2 68 

Qy 1027 CAGTACAGCAAGGTCCTGGCTCTGTACAACCAGCACAACCCGGGCGCGTCGGCGGCGCCG 1086 

III IMIIIIIMIMIII II MIMMII Ml II 

Db 2 67 CAGGACAGCAAGGTCCTGGCCCTGTACAACCAGCATAACCCGG- -GCCTCGGCGGCGCCG 210 

Qy 1087 TGCTGCGTGCCGCAGGCGCTGGAGCCACTGCCCATCGTGTACTACGTGGGCCGCAAGCCC 114 6 

III II 1 1 M MM 1 1 1 1 1 1 M I M MMM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 

Db 209 TGG-GCGTGCCGCAGGCGCTGGAGCCGCTGCCCTTCGTGTACTACGTGGGCCGCAAGCCC 151 

Qy 114 7 AAGGTGGAG CAG CTGTCCAACATGAT CGTGCGTT C CTG CAAGTG CAG CTG 1196 

MMM II I II I WWW lllllllll II MM 

Db 15 0 AAGGTGGGGCGGCTGTCCAACATGGTCGTGCGCTCCTGCAAGGGCCGCTG 101 



Search completed: October 28, 2003, 00:08:13 
Job time : 2974.42 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



October 27, 2003, 19:11:58 ; Search time 33.2026 Seconds 

(without alignments) 
1902.657 Million cell updates/sec 

US-10-017-372E-13 
2114 

1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNMI VRSCKCS 3 98 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1107863 seqs, 158726573 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1107863 



Database : 



A_Geneseq_19Jun03 : ' 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 



/SIDSl/gcgdata/geneseq/geneseqp- 
/S I DSl/gcgdata /geneseq/geneseqp - 
/SIDSl/gcgdata/geneseq/geneseqp- 
/S I DS 1 / gcgda t a/geneseq/geneseqp- 
/S IDS 1 /gcgda t a/geneseq/geneseqp - 
/S IDS 1 /gcgda t a/geneseq/geneseqp- 
/S IDS 1 /gcgda ta/geneseq/geneseqp- 
/SIDSl/gcgdata/geneseq/geneseqp- 
/S I DSl/gcgdata /geneseq/geneseqp - 
/ SI DS1 /gcgda ta/geneseq/geneseqp 
/SI DSl/gcgdata /geneseq/geneseqp 
/ S I DS 1 /gcgda t a /geneseq/ geneseqp 
/SI DSl/gcgdata /geneseq/geneseqp 
/SI DSl/gcgdata /geneseq/geneseqp 
/ SIDSl/gcgdata/geneseq/geneseqp 
/ SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 
/S I DS 1 / gcgdata/geneseq/geneseqp 
/SIDSl/gcgdata/geneseq/geneseqp 



embl/AA1980. 
embl/AA1981 . 
embl/AA1982 . 
embl/AA1983 . 
embl/AA1984 . 
embl/AA1985. 
embl/AA1986. 
embl/AA1987. 
embl/AA1988 . 
-embl/AA1989 
-embl/AA1990 
-embl/AA1991 
-embl/AA1992 
-embl/AA1993 
-embl/AA1994 
-embl/AA1995 
-embl/AA1996 
-embl/AA1997 
-embl/AA1998 
-embl/AA1999 
-embl/AA2000 
-embl/AA2001 
-embl/AA2002 
-embl/AA2003 



DAT- * 
DAT 
DAT 
DAT 
DAT 
DAT 
DAT 
DAT 
DAT 
.DAT 
.DAT 
.DAT 
.DAT 
.DAT 
.DAT 
.DAT 
.DAT 
.DAT 
.DAT 
.DAT 
.DAT 
.DAT 
.DAT 
.DAT 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 



and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 



No . 
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ALIGNMENTS 



RESULT 1 
AAE13596 

ID AAE13596 standard; Protein; 390 AA. 
XX 

AC AAE13596; 
XX 

DT 26-FEB-2002 (first entry) 
XX 

DE Porcine transforming growth factor beta 1 (TGF-betal) mutant. 
XX 

KW Porcine; transforming growth factor beta 1; TGF-betal; gene therapy; 

KW IBD; inflammatory bowel disease; autoimmune disease; immunosuppressive; 

KW multiple sclerosis; rheumatoid arthritis; systemic lupus erythematosus; 

KW diabetes mellitus; sarcoidosis; psoriasis; dermatological ; mutant; 

KW mutein. 
XX 

OS Sus scrofa. 
XX 

FH Key Location/Qualif iers 

FT Misc-dif ference 223 

FT /note= "Wild type Cys substituted with Ser" 

FT Misc-dif ference 225 

FT /note= "Wild type Cys substituted with Ser" 

XX 

PN WO200181404-A2 . 
XX 

PD 01-NOV-2001. 
XX 

PF 20-APR-2001; 2001WO-US12980 . 
XX 

PR 20-APR-2000; 2 000US - 19 9 0 14 P . 
XX 

PA (USSH ) US DEPT HEALTH & HUMAN SERVICES. 
XX 

PI Strober W, Nakamura K, Kitani A, Fuss IJ; 
XX 

DR WPI; 2002-026155/03. 

DR N-PSDB; AAD22696. 
XX 

PT Composition for treating autoimmune diseases e.g. inflammatory bowel 

PT disease in humans, comprises vector containing transforming growth 

PT factor-beta under the control of inducible promoter 
XX 

PS Example 1; Fig 1; 78pp; English. 
XX 

CC The invention relates to a composition containing a vector comprising a 

CC gene encoding a regulatory transcription factor under the control of a 

CC promoter encoding a transforming growth factor-beta (TGF-beta) . The 

CC vector is useful for expressing TGF-beta, such as TGF-betal, TGF-beta2 

CC or TGF-beta3, its variants or homologues, by transfecting a cell which 

CC is part of a host suspected of having an autoimmune disease, especially 

CC inflammatory bowel disease (IBD) , under conditions such that the 

CC polypeptide encoded by the nucleic acid sequence in the vector is 

CC expressed. The vector is delivered using a delivery system. The delivery 

CC of the vector results in substantial elimination of symptoms of the 

CC autoimmune disease and increased production of IL-10 by the host. The 

CC composition is useful for treating various diseases with an autoimmune 



CC component such as multiple sclerosis, rheumatoid arthritis, systemic 

CC lupus erythematosus, insulin-dependent diabetes mellitus, sarcoidosis 

CC and psoriasis, and also for assaying the expression of a gene in a cell- 

CC The vector is further useful for screening of the effect of test 

CC compounds on cytokine (e.g. TGF-beta) expression of transfected cells. 

CC The present sequence is porcine TGF-betal mutant. 

XX 

SQ Sequence 3 90 AA; 

Query Match 97.1%; Score 2053; DB 23; Length 390; 

Best Local Similarity 98.0%; Pred. No. 1.5e-173; 

Matches 390; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I lllllll IMIII'IIIMIIIM Ml I! Ill li MM II III! II III I Mill III 

Db 1 MA PSGLRLLPLLLPLLWLLVLT PGR PAAGLSTCKT I DMELVKR ICR I EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

III MM Ml III MM IMM Mill II Ml III MM III III III II I I II III 

Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

I M M M M M M M M M I M M M M M M I 1 1 1 1 1 1 1 1 M M M M M M M M M I 

Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Qy 181 YLSNRLLAPSDS PEWLS FDVTG WRQWLTRREA I EGFRLSAHS S SDS KDNTLHVE I NGFN 24 0 

i MIMM III III MM Mill IMM II III Ml IMM IIMM III II Mill III 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

Qy 241 SGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

M M M M M M M M M M I M M M M I M 1 1 M M M M M M M I Ml 

Db 241 SGRRGDLATI HGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

M I M I M I M I M M I M 1 1 1 M M M 1 1 M 1 1 1 1 M I M M 1 1 1 1 1 1 1 1 1 1 M 1 1 M I 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

MIMIII Ml III III Mill Mill II III Ml II 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 2 
AAM39186 

ID AAM39186 standard; Protein; 390 AA. 
XX 

AC AAM39186; 
XX 

DT 22-OCT-2001 (first entry) 
XX 

DE Human polypeptide SEQ ID NO 2331. 
XX 

KW Human; nootropic; immunosuppressant; cytostatic; gene therapy; cancer; 

KW peripheral nervous system; neuropathy; central nervous system; CNS; 

KW Alzheimer's; Parkinson's disease; Huntington's disease; haemostatic; 

KW amyotrophic lateral sclerosis; Shy-Drager Syndrome; chemotactic; 

KW chemokinetic; thrombolytic; drug screening; arthritis; inflammation; 



KW leukaemia. 
XX 

OS Homo sapiens. 
XX 

PN WO200153312-A1 . 
XX 

PD 26-JUL-2001. 
XX 

PF 26-DEC-2000; 2 000WO-US34263 . 
XX 

PR 21-JAN-2000; 2000US-0488725 . 

PR 25-APR-2000; 2 000US- 05523 17 . 

PR 09-JUL-2000; 2 000US- 0598042 . 

PR 19-JUL-2000; 2 000US- 062 03 12 . 

PR 03-AUG-2000; 2 000US- 0653450 . 

PR 14-SEP-2000; 2 000US - 0662 1 9 1 . 

PR 19-OCT-2000; 2000US-0693036 . 

PR 29-NOV-2000; 2 000US - 072 7344 . 
XX 

PA (HYSE-) HYSEQ INC. 
XX 

PI Tang YT, Liu C, Asundi V, Chen R, Ma Y, Qian XB, Ren F, Wang D; 

PI Wang J, Wang Z, Wehrman T, Xu C ( Xue AJ, Yang Y, Zhang J; 

PI Zhao QA, Zhou P, Goodrich R, Drmanac RT; 
XX 

DR WPI; 2001-442253/47. 

DR N-PSDB; AAI58342. 
XX 

PT Novel nucleic acids and polypeptides, useful for treating disorders 

PT such as central nervous system injuries - 

XX 

PS Example 4; SEQ ID NO 2331; 10078pp ; English. 
XX 

CC The invention relates to human nucleic acids (AAI57798-AAI61369) and 

CC the encoded polypeptides (AAM38642-AAM42213 ) with nootropic, 

CC immunosuppressant and cytostatic activity. The polynucleotides are useful 

CC in gene therapy. A composition containing a polypeptide or polynucleotide 

CC of the invention may be used to treat diseases of the peripheral nervous 

CC system, such as peripheral nervous injuries, peripheral neuropathy and 

CC localised neuropathies and central nervous system diseases, such as 

CC Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 

CC lateral sclerosis, and Shy-Drager Syndrome. Other uses include the 

CC utilisation of the activities such as: Immune system suppression, 

CC Act ivin/inhibin activity, chemotactic/chemokinetic activity, haemostatic 

CC and thrombolytic activity, cancer diagnosis and therapy, drug screening, 

CC assays for receptor activity, arthritis and inflammation, leukaemias and 

CC C.N.S disorders. 

CC Note: The sequence data for this patent did not form part of the printed 

CC specification. 

XX 

SQ Sequence 3 90 AA; 

Query Match 91.3%; Score 1931; DB 22; Length 390; 

Best Local Similarity 92.0%; Pred. No. 9.9e-163; 

Matches 366; Conservative 10; Mismatches 14; Indels 8; Gaps 1; 



Qy 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 



Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


293 


Qy 


361 


Db 


353 



I IIIIIIIIMIII IMIMI IMIIIIMIM MMMMMMMMMMM 

MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 
SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVlaMVESGNQI 120 

M 1 1 1 M M 1 1 1 MM 1 1 1 1 1 II M 1 1 M M ! 1 1 III M M M I III II II M hi 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 
YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

MMI = MM M M M M I MM Ml M I M III III 1 1 1 II II I II I II M I I 

YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 
YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

Mill MUM I Mill MINI 

YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 
SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 300 

M M 1 1 II I II 1 1 M 1 1 1 M 1 1 1 1 IM Ml lllllllllllllllll III 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 2 92 

CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

I I IM 1 1 Ml II II III I M 1 1 1 1 1 II I III II 1 1 1 II I II 1 1 1 II I M 1 1 1 1 1 1 1 1 1 I 

CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 52 



APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 

I M 1 1 II II 1 1 1 MM I M I M 1 1 II 1 1 II 1 1 1 li M 

APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 



398 



390 



AAP61468; 

31-OCT-2 002 (updated) 
28-OCT-1991 (first entry) 

PreTGF-beta gene product. 

Transforming growth factor beta; cancer; wound healing. 
Unidentified. 



RESULT 3 
AAP61468 

ID AAP61468 standard; Protein; 390 AA. 
XX 
AC 
XX 
DT 
DT 
XX 
DE 
XX 
KW 
XX 
OS 
XX 
FH 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
XX 
PA 
XX 



Key 

Protein 
EP200341-A. 
10-DEC-1986 . 

21- MAR-1986; 

22- MAR-1985; 
13-MAR-1987; 



Location/Qualif iers 
279. .390 



86EP-0302112 . 

85US-0715142 . 
87US-0025423 . 



(GETH ) GENENTECH INC. 



PI 

XX 
DR 
DR 
XX 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



Derynck RMA; 

WPI; 1986-326875/50. 
N-PSDB; AAN60972. 

TGF-beta prodn. from transformed hosts 
wounds (J6 2/9/86) . 

Disclosure; Fig lb; 26pp; English. 



useful esp. for treating 



The gene product is known to stimulate cell proliferation and 
inhibit anchorage-dependent growth of a variety of human cancer cell 
lines, it is esp. useful in treatment of burns and the promotion of 
surface and internal wound healing. TGF-beta may be expressed from a 
transformed CHO cell line. 

(Updated on 31-OCT-2002 to add missing OS field.) 

Sequence 3 90 AA; 

Query Match 91.0%; Score 1924; DB 7; Length 390; 

Best Local Similarity 91.7%; Pred. No. 4.1e-162; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 



1; 



Qy 


1 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


293 


Qy 


361 


Db 


353 



MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAIRGQILSKLRLA 60 

I I II MM II II M III 1 1 1 II I 1 1 II 1 1 1 1 1 1 1 ! II 1 1 1 1 1 1 1 1 1 II ,1 1 1 1 1 1 1 1 1 

MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRI EAIRGQILSKLRLA 60 
SPPSQGDVPPGPLPEAVlALYNSTRDRVAGESVEPEPEPEADYYAKEvTRVLMVESGNQI 12 0 

I I I I MM I M II M I I I I II II II M III I I I I I il 1 , 1 1 II 1 1 1 1 II 1 1 Ih hi 

SPPSQGEVPPGPLPEIAVTiALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVljMVETHNEI 120 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

I I I I I : 11 = 1! MIMIMIIIIIIIIIII III Ml MUM MMIM III Mill 
YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

II I Mill 1 1 II 1 1 II III III II II I hi II 1 1 II III I I hi II I hi II I 

YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 
SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

IMIIIIIIIIIIIIIIIIIMII 1 1 1 1 1 1 1 h 1 1 1 1 1 1 III 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 2 92 

CCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHNPGASA 360 

I h I I I I I I I I I I I I h I I I I I I I I I II II I I I I I I II I I I I II I I I I I: I I II I II I I 
CCWQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPY I WSLDTQYSKVLALYNQHNPGASA 352 



APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 

MMIM IMM I Mill I MIIIIIIIIMI III 1 1 

APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 



398 



390 



RESULT 4 
AAR04034 

ID AAR04034 standard; protein; 390 AA. 
XX 

AC AAR04034; 



XX 
DT 
DT 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
FH 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
XX 
SQ 



2 5-MAR-2 003 (updated) 
31-OCT-2 002 (updated) 
31-MAY-1989 (first entry) 

Sequence of pre-TGF-beta 1. 

Transforming growth factor beta-3 (TGF beta 3) ; tumour cells; growth 
inhibition. 



Homo sapiens. 

Key 
CDS 

WO8912101-A. 
14-DEC-1989 . 
08-JUN-1988; 
08-JUN-1988; 



Locat ion/Qua 1 i f iers 
348. .500 



88WO-US01945 . 
88WO-US01945. 



(GETH ) GENENTECH INC. 

Dernyck RMA, Goeddel DV; 

WPI ; 1990-007474/01 . 
N-PSDB; AAQ02815. 

Nucleotide sequence encoding transforming growth factor beta-3 used as a 
probe, or to produce TGF beta 3, for inhibiting growth of certain normal 
and neoplastic cells, eg A549. 

Disclosure; Fig. 2; 61pp; English. 

Sequence is an exon of transforming growth factor-beta 1 (pre-TGF-beta 
1) polypeptide and corresponds to AA's 288-338 of mature TGF-beta 1. The 
nucleic acid encoding second subtype of TGF-beta (TGF-beta 3) is useful 
as a probe or to produce TGF-beta 3 for inhibition of normal and 
neoplastic cell growth. 

(Updated on 31-OCT-2002 to add missing OS field.) 
(Updated on 25-MAR-2003 to correct PR field.) 
(Updated on 25-MAR-2003 to correct PI field.) 



Sequence 3 90 AA; 

Query Match 91.0%; Score 1924; DB 11 

Best Local Similarity 91.7%; Pred. No. 4.1e-162 
Matches 365; Conservative 10; Mismatches 15 



Length 3 90; 

Indels 8 ; Gaps 1; 



Qy 



Db 



Qy 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I llllllllllllllllllllll II III III III I'l Mill III II MINIM III 

1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 



61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

: 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 MMI Ml 1 1 1 II Ml 1 1 M hi 



120 



Db 



61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 



Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll Mill Mill Ml II 1 1 Mill Ml IIMIIi II III I 1 ! I--III 

Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

M III MM MUM M IM Mill I IIMIII MINI 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKX)DDDKEKN 3 00 

M MMM MMIIIMM III II III III I 1 1 1 1 III 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 292 

Qy 301 CCVRQLYI DFRKDLGWKWI HE PKGYHANFCLGPCPY I WSLDTQYSKVLALYNQHNPGAS A 360 

II MM II IMIMIMM III II III III I Ml Ml III IIMIII II IM IM Mill 

Db 2 93 CCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 52 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IMIMM MMIIIMM III II III II 1 1 IM Ml 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 5 




AAR05258 




ID 


AAR05258 standard; protein; 390 AA. 


XX 






AC 


AAR05258; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


05-AUG-1990 


(first entry) 


XX 






DE 


Human pre-transf orming growth f actor-beta-1 (pre-TGF-beta- 1 


XX 






KW 


Transforming < 


growth factor-beta- 1 (TGF-beta-1) ; 


KW 


neoplastic cell line inhibition; 


KW 


EGF-potentiated anchorage -independent growth. 


XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Location/Qualif iers 


FT 


Peptide 


1 . .278 


FT 


Protein 


279. .2011 


FT 


Domain 


8 . .23 


FT 




/note= "hydrophobic domain" 


FT 


Modif ied-site 


82. .84 


FT 




/note="potential N-glycosylation site 11 


FT 


Modif ied-site 


136 . . 138 


FT 




/note="as above" 


FT 


Modif ied-site 


176 . . 178 


FT 




/note="as above" 


FT 


Cleavage-site 


277 . . 278 


FT 




/note="proteolytic cleavage site" 


XX 






PN 


US4886747-A. 




XX 






PD 


12-DEC-1989. 





XX 

PF 13-MAR-1987; 8 7US - 002 5423 . 
XX 

PR 13-MAR-1987; 87US-0025423 . 

PR 22-MAR-1985; 85US-0715142 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1990-051338/07. 

DR N-PSDB; AAQ93301. 
XX 

PT Nucleic acid encoding transforming growth factor-beta - 

PT cloned into expression vectors for expression in eukaryotic host 

PT cells for therapeutic use 

XX 

PS Disclosure; Fig lb; 28pp; English. 
XX 

CC Pre-TGF-beta-1 is a polypeptide of 390 amino acids. Post-transitional 

CC cleavage of the precursor gives rise to the mature TGF-beta monomer. 

CC The sequence for human TGF-beta was determined by direct amino acid 

CC sequence analysis and by deduction from the TGF-beta cDNA . It is 

CC capable of inducing EGF-potent iated anchorage- independent growth of 

CC target cell lines, and/or growth inhibition of neoplastic cell lines. It 

CC can be used for treating wounds, eg burns or epidermal ulcers. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 3 90 AA; 

Query Match 91.0%; Score 1924; DB 11; Length 390; 

Best Local Similarity 91.7%; Pred. No. 4.1e-162; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 !: I I i I ! 1 1 1 1 1 1 1 i II 1 1 1 1 1 1 1 1 I !l I M M I II 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

II 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II ! I M 1 1 1 1 lllllllllllllllllllllh hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVTjLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Mill = Ihll I I I I I I I I II i I M I I I I I I I I I I I I II I I I I II I I 11 I I I: I II 
Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I IM IIMIIIII I hill hill 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

= I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I III 
Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 292 

Qy 3 01 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 293 CCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 



Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

1 1 1 M Ml I U M 1 1 1. 1 N I ! 1 1 1 1 1 II Ml I M'l 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



AAR13813; 

20-NOV-1991 (first entry) 
Human pro-TGF-beta 1. 
Osteogenetic; tumoricidal . 
Homo sapiens. 



Key 

Peptide 
Peptide 
Peptide 

JP03180192-A. 

06- AUG-1991 . 

07- DEC-1989; 
07-DEC-1989; 



Location/Qualifiers 
1. .29 

/note= "signal peptide" 
30. .390 

/note= "pro-TGF-beta 1" 
279. .390 

/note= "TGF-beta 1" 



RESULT 6 
AAR13813 

ID AAR13813 standard; Protein; 390 AA. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
XX 
OS 
XX 
FH 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 

XX 
SQ 



89JP-0318243. 
89JP-0318243 . 



(KIRI ) KIRIN BREWERY KK. 

WPI ; 1991-271579/37. 
N-PSDB; AAQ13392. 

Human pro-TGF-beta 1 prodn., for osteo-genet ic activity - by 
preparing DNA chain contg. base sequence coding for human 
pre: pro-TGF-beta 1, forming expression vector etc. 

Claim 1; Fig 1; 16pp; Japanese. 

The amino acid sequence codes for human prepro-TGF-beta 1 which 
can be produced by recombinant methods, it has osteogenetic and 
tumoricidal activity. 

Sequence 3 90 AA; 



Query Match 91.0%; Score 1924; DB 12 

Best Local Similarity 91.7%; Pred. No. 4.1e-162 
Matches 365; Conservative 10; Mismatches 15 



Length 3 90; 

Indels 8; Gaps 



1; 



Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I 1 1 1 1 Ml 1 1 M 1 1 III I II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml II 1 1 1 1 1 1 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

1 1 1 1 1 h II 1 1 1 1 1 1 Ml 1 1 II M 1 1 ! 1 1 1 1 1 II M 1 1 i 1 1 1 1 1 1 1 1 1 M I M hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

MIM = MMI 1 1 1 II 1 1 II II 1 1 : 1 1 M II 1 1 1 M I M M 1 1 1 1 1 1 1 II I II M II 

Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

M il 1 1 1 III 1 1 Ml 1 1 II 1 1 1 1 II 1 1 Ml MMMMI I Mil MMI 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

M 1 1 1 Ml 1 1 1 II 1 1 Ml I II I II 1 1 1 1 1 1 MIMM Mill" Ml 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 292 

Qy 3 01 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 60 

I Ml 1 1 II I MM II 1 1 1 1 1 II I M 1 1 II I II 1 1 II 1 1 MM 1 1 1 II I M 1 1 M I II I II 

Db 293 CCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

I Ml 1 1 II I Ml 1 1 1 1 II 1 1 II MM I II I ! II I II 1 1 

Db 353 APCGVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 390 



RESULT 7 




AAR73596 




ID 


AAR73596 standard; Protein; 


XX 






AC 


AAR73596; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


20-DEC-1995 


(first entry) 


XX 






DE 


Human TGF-beta 1 protein. 


XX 






KW 


Transforming growth factor-t 


KW 


TGF-beta 3; osteogenic cell 


KW 


bone- inducing 


cof actor . 


XX 






OS 


Homo sapiens. 




XX 






PN 


US5409896-A. 




XX 






PD 


25-APR-1995. 




XX 






PF 


12-NOV-1993; 


93US-0132405. 


XX 






PR 


01-SEP-1989; 


89US-0401906. 


PR 


12-NOV-1991; 


91US-0790856. 


PR 


18-MAY-1993; 


93US-0063841. 


PR 


12-NOV-1993; 


93US-0132405. 


XX 







PA (GETH ) GENENTECH INC. 
XX 

PI Ammann AJ, Rudman CG; 
XX 

DR WPI; 1995-169610/22. 
XX 

PT Compsn. for treating skeletal tissue deficiency - comprising 

PT transforming growth factor-beta and an osteogenic cell source in a 

PT carrier 

XX 

PS Claim 3; Column 15-18; 19pp; English. 
XX 

CC This sequence represents human transforming growth factor-beta 1 

CC (TGF-beta 1) . This sequence and the sequence for human TGF-beta 3 (see 

CC AAR73598) are claimed within the scope of the invention. The invention 

CC is a composition consisting of a TGF-beta protein and an osteogenic cell 

CC source (OCS) formulated in an acceptable carrier other than a bone 

CC morphogenic cof actor. This composition can be used for the restoration 

CC of bone deficiency. This provides for the generation of mature bone 

CC only where it is required, without the inclusion of a specific 

CC bone-inducing cofactor. This method can be used with any of the 5 human 

CC TGF-beta' s or with TGF-beta from other species. 

CC (Updated on 25-MAR-2003 to correct PF field.) 

XX 

SQ Sequence 3 90 AA; 

Query Match 91.0%; Score 1924; DB 16; Length 390; 

Best Local Similarity 91.7%; Pred. No. 4.1e-162; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

i iiiiiiiiiiiiiiiiiiiiii 1 1 s 1 1 1 1 1 1 , m 1 1 1 1 1 1 1 1 , 1 1 ; ; 1 m i 1 1 1 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

sir:, lllllllllillllllllllll: hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll I I I I I I II I I I I I I I I I I I I I I Ml I I I I II I I ! I I I I I I I h I II 
Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVXiLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

IIIMIIIIIIIIIIIIIIIIIIIIIIhl lllllllll I Ihllll hllll 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEI EGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

M I II 1 1 1 M 1 1 i 1 1 II I : I M 1 1 1 1 1 M I i 1, 1 1 1 Ih II I MM 1 1 III 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IE 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 M 1 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 

Db 2 93 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 52 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

II M M 1 1 M M . II I II I II 1 1 M M M I M 1 1 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 8 
AAR90827 

ID AAR90827 standard; Protein; 390 AA. 
XX 

AC AAR90827; 
XX 

DT 25-MAR-2003 (updated) 

DT 25-JAN-1980 (first entry) 

XX 

DE Pre-transf orming growth factor beta 1. 
XX 

KW transforming growth factor beta 1; wound healing; 

KW recombinant production. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualif iers 

FT Domain 8 . . 23 

FT /note= "hydrophobic domain" 

FT Modif ied-site 82 . . 84 

FT /note= "potential N-glycosylation site" 

FT Modif ied-site 136.. 138 

FT /note= "potential N-glycosylation site" 

FT Modif ied-site 176.. 178 

FT /note= "potential N-glycosylation site" 

FT Cleavage-site 277.. 279 

FT /note= " tryps in- 1 ike peptidase cleavage site" 

FT Protein 279 . .390 

FT /label= mature_TGF_beta_l 

XX 

PN US5482851-A. 
XX 

PD 09-JAN-1996. 
XX 

PF 05-NOV-1993; 93US - 014 73 64 . 
XX 

PR 13-MAR-1987; 8 7US - 0025423 . 
PR 22-MAR-1985; 85US-0715142 . 
PR 04-AUG-1989; 89US-0389929 . 
PR 04-MAR-1992; 92US - 084 58 93 . 
PR 05-NOV-1993; 93US-0147364 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1996-076891/08. 
DR N-PSDB; AAT15720. 
XX 

PT New recombinant human transforming growth factor-beta prods. - produced 
PT using Chinese hamster ovary cells, for use in diagnostic applications 
PT or in therapy 
XX 

PS Example 3; Fig 1A-C; 26pp; English. 
XX 



CC The pre- trans forming growth factor (TGF) beta 1 protein is encoded 

CC by AAT15720. The mature TGF beta 1 monomer is cleaved from the 

CC precursor at the Arg-Arg dipeptide immediately preceding the mature 

CC TGF-beta 1 NH2 -terminus . It does not contain a recognisable N-terminal 

CC signal peptide typical of most secreted proteins. The pre -TGF beta 1 

CC contains several pairs of basic residues which could undergo 

CC post-translational cleavage and give rise to separate polypeptide 

CC entities. The precursor contains 3 potential N-glycosylation sites, none 

CC of which are localised in the mature TGF beta 1. This is useful in 

CC purification of the mature protein. TGF beta 1 can be used in, e.g. wound 

CC healing. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 3 90 AA; 

Query Match 91.0%; Score 1924; DB 17; Length 390; 

Best Local Similarity 91.7%; Pred. No. 4.1e-162; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1! 1 1 II 1 1 I I'l 1 1 1 1 1 1 M II I II ! 1 1 II Ml 1 1 1 1 1 1 1 1 1 : 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhlllllllMIIIIIIIIIIIIIIII lllllllllllllllllllllh hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Mill : Ihll I I I I I 1 I I I I I I I I I I I I I I I I ! I I I I I I I I I 1 I I I I I I I I I : I I I 
Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDS PEWLS FD VTGWRQWLTRREA I EGFRLSAHSS SDS KDNTLHVE I NGFN 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I lllllllll I Ihllll hllll 
Db 181 YLSNRLLAPSDS PEWLSFDVTG WRQWLSRGGE I EGFRLSAHCS CDSRDNTLQVD I NGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

MM III 1 1 1 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 M 1 1 IIMIMM Mill Ml 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 2 92 

Qy 3 01 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

1 1 1 M 1 1 1 1 M I II II II M Ml I 1 1 I II I II MM II 1 1 1 1 1 1 Ml II II I II I II 1 1 

Db 2 93 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 9 
AAU77101 

ID AAU77101 standard; Protein; 390 AA. 
XX 

AC AAU77101; 
XX 

DT 05-JUN-2002 (first entry) 
XX 

DE Human transforming growth factor beta 1 (TFG-beta-1) polypeptide. 



XX 

KW Human; transforming growth factor beta; TGF-beta; insulin production; 

KW type I diabetes mellitus; pancreatic cell outgrowth; wound healing; 

KW pancreatic duct tissue; ischaemia; stroke; nervous system aging; 

KW neurological condition; neurodegenerative disease; inflammation; 

KW vasal injury; chemical injury; traumatic injury; tumour- induced injury; 

KW amyotrophic lateral sclerosis; spinocerebellar degeneration; 

KW immunological disease; multiple sclerosis; TGF-beta- 1. 

XX 

OS Homo sapiens. 
XX 

PN WO200212336-A2 . 
XX 

PD 14-FEB-2002. 
XX 

PF 09-FEB-2001; 2 00 1WO-US04 192 . 
XX 

PR 09-AUG-2000; 2 00 OUS - 063 5368 . 
XX 

PA (CURI-) CURIS INC. 
XX 

PI Wang M, Pang K; 
XX 

DR WPI; 2002-257468/30. 
XX 

PT Treating a subject with a disorder resulting from insufficient insulin 

PT production, and inducing outgrowth of pancreatic cells, involves using 

PT a transforming growth factor beta therapeutic 
XX 

PS Disclosure; Fig 1; 77pp ; English. 
XX 

CC The invention relates to treating a subject with a disorder resulting 

CC from insufficient insulin production, involving contacting the subject 

CC with a transforming growth factor beta (TGF-beta) therapeutic. TGF-beta 

CC polypeptides can be used for treating a subject with a disorder resulting 

CC from insufficient insulin production, e.g. type I diabetes mellitus, and 

CC for inducing outgrowth of pancreatic cells associated with pancreatic 

CC duct tissue within a subject. A composition comprising a TGF-beta protein 

CC may be useful in wound healing and treatment of neurological conditions 

CC derived from acute, subacute or chronic injury to the nervous system, 

CC including traumatic injury, chemical injury, vasal injury and deficits 

CC (such as ischaemia resulting from stroke) , together with 

CC infectious/inflammatory and tumour- induced injury, aging of the nervous 

CC system including Alzheimer's disease, chronic neurodegenerative diseases 

CC including Parkinson's disease, Huntington's chorea, amyotrophic lateral 

CC sclerosis, spinocerebellar degenerations and chronic immunological 

CC diseases of the nervous system or affecting the nervous system, including 

CC multiple sclerosis. This sequence represents the human TGF-beta-1 

CC protein. 

XX 

SQ Sequence 3 90 AA; 



Query Match 90.9%; Score 1921; DB 23 

Best Local Similarity 91.7%; Pred. No. 7.6e-162 
Matches 365; Conservative 10; Mismatches 15 



Length 3 90; 

Indels 8; Gaps 1; 



Qy 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 



I lllllll I'lM! Illlllllll illl IIIM'MIII IIIIIIIIIIIMI 

1 MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 



Db 


1 


MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 


60 


Ov 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 


120 






lllllhlllllllllllllllllllllllll llllllllllllllllllllh hi 




Db 


61 


SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 


120 


Ov 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 


180 






Mill : Ihll IMMIMI M 1 I! Ill Ml III II II Ml Ml 




Db 


121 


YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 


180 


Qy 


181 


YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 


240 






MMMMIIIIIMIIIIIIIIMIIMI IMMMM 1 Ml MM 




Db 


181 


YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 


240 


Ov 


241 


SGRRGDLATIHGMNRPFLLL^4ATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 


300 






• 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 lllllllllllllllll III 

• 1 1 II 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllllllllllllll III 




Db 


241 


TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 


292 


Qy 


301 


CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 


360 






i M : 1 1 1 1 1 ' 1 1 1 1 1 1 1 1 . ii mm i ; 1 1 1 1 1 ; 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 m i m 1 1 1 1 




Db 


293 


CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 


352 


Qy 


361 


APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 








MIMM MUM 1 1 II 1 MIMM Mill 




Db 


353 


APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 





RESULT 10 
AAE16943 

ID AAE16943 standard; Protein; 390 AA. 
XX 

AC AAE16943; 
XX 

DT 18-APR-2002 (first entry) 
XX 

DE Human transforming growth factor-betal (TGF-betal) protein. 
XX 

KW Human; transforming growth factor-betal; TGF-betal; osteoporosis; 

KW latency associated peptide; LAP; integrin alphavbeta3; apoptosis; 

KW immunomodulation; inflammatory disease; fibrotic disease; cancer; 

KW diabetic retinopathy; chronic obstructive pulmonary disorder; 

KW bone resorption; rheumatoid arthritis; psoriasis; restenosis; 

KW atherosclerosis; liver fibrosis; asthma; cytostatic; osteopathic; 

KW ophthalmological ; antiarteriosclerotic ; vasotropic . 
XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT Peptide 1 . .29 

FT /label= Signal_peptide 

FT Region 30 . . 278 

ft /note= n LAP -bet a 1" 

FT Domain 244 . . 246 

FT /note= " RGD motif" 

FT Protein 279.. 390 

FT /note= "Human mature TGF-betal protein" 



XX 

PN WO200190186-A1. 
XX 

PD 29-NOV-2001. 
XX 

PF 25-MAY-2001; 2001WO-GB02352 . 
XX 

PR 26-MAY-2000; 2000GB- 0012991 . 

PR 05-JAN-2001; 2001GB- 0000286 . 
XX 

PA (GLAX ) GLAXO GROUP LTD. 
XX 

PI Ludbrook S, Barry S, Horgan C, Miller D; 
XX 

DR WPI; 2002-097645/13. 
XX 

PT Identifying modulators of interactions between latency associated 

PT peptides and integrin alphavbeta3 for therapeutics, by contacting the 

PT peptide and integrin with a test product and determining if the product 

PT modulates interaction - 

XX 

PS Disclosure; Page 37-39; 44pp ; English. 
XX 

CC The invention relates to a method for identification of a modulator 

CC of the interaction between latency associated peptide (LAP) of 

CC transforming growth factor-betal (TGF-betal) and integrin alphavbeta3 . 

CC The method is useful for identifying a modulator of the interaction 

CC between LAP and integrin alphavbeta3. The method is useful for 

CC immunomodulation, in the treatment of inflammatory disease, fibrotic 

CC disease, cancer, diabetic retinopathy, bone resorption or osteoporosis, 

CC and for preventing apoptosis administering the modulator to the host. 

CC The modulator (inhibitor of the interaction between LAP-betal and 

CC integrin alphavbeta3) is useful in the manufacture of a medicament for 

CC immunomodulation. The modulator (activator of the interaction between 

CC LAP-betal and integrin alphavbeta3 ) is useful in the manufacture of 

CC medicament for preventing apoptosis. The modulator is useful for 

CC treating a inflammatory or fibrotic disease such as chronic obstructive 

CC pulmonary disorder, rheumatoid arthritis, psoriasis, restenosis, 

CC atherosclerosis, liver fibrosis and asthma. The present sequence is 

CC human TGF-betal protein. 

XX 

SQ Sequence 3 90 AA; 



Query Match 90.9%; Score 1921; DB 23; Length 390; 

Best Local Similarity 91.7%; Pred. No. 7.6e-162; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 
Qy 1 MA PSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKT I DMELVKRKR I EA I RGQ I LS KLRLA 60 

I Mill 1 1 M I '1 1 1 1 1 ! 1 1 1 MM I M 1 1 1 1 II I II 1 1 M I hi II 1 1 1 1 1! 1 1 

Db 1 MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EA I RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I I I Ih I I I I I ( I II I i I II I I I I I II I I i I I I II i I I I II I I I II I I I I I I : hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 



Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Mill : Ihl I II 1 1 Ml II II M I M M 1 1 III II Ml 1 1 Ml II 1 1 !( M I li 



Db 



121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 



Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

MM MIIMI Ml I II Ml I II Ml M I lllllllll I Ihllll Mill 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEI EGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 300 

: I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I I II I I II III 
Db 241 TGRRGDLATI HGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

1 1 ; 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 ii ii 1 1 ii 1 1 1 1 1 1 1 1 1 1 ii i ii : 1 1 1 1 ii 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPI VYYVGRKPKVEQLiSNMI VRSCKCS 398 

1 1 : 1 1 1 1: 1 1 : 1 I I I I Ml I II I I I I I I I I' I I II 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 11 
ABB82780 

ID ABB82780 standard; Protein; 391 AA. 
XX 

AC ABB82780; 
XX 

DT 18-MAR-2003 (first entry) 
XX 

DE TGFB1 Arg25Pro polymorphism G-allele protein sequence. 
XX 

KW Polymorphism; transforming growth factor beta 1; TGFB1 ; TGFbeta-1; 

KW renal failure; nephrotropic ; human; allele. 

XX 

OS Homo sapiens. 
XX 

PN WO200290585-A2 . 
XX 

PD 14-NOV-2002. 
XX 

PF 08-MAY-2002; 2002WO-GB02 066 . 
XX 

PR 09-MAY-2001; 2001GB-0011277 . 
XX 

PA (UYSH-) UNIV SHEFFIELD HALLAM. 
XX 

PI El-Nahas AM, Blakemore A, Khalil MS; 
XX 

DR WPI; 2003-120560/11. 

DR N-PSDB; ABV75391. 
XX 

PT Determining an individual's susceptibility to the progression of renal 

PT failure comprises detecting the presence of a genetic polymorphism 

PT pattern in transforming growth factor beta 1 (TGFB1) gene in a sample 

PT from the individual 
XX 

PS Claim 51; Page 59-61; 62pp ; English. 
XX 

CC The invention relates to determining an individual's susceptibility to 



CC renal failure and invovles detecting the presence of a genetic 

CC polymorphism pattern in transforming growth factor beta 1 (TGFB1) gene in 

CC a sample from an individual, where polymorphism pattern is associated 

CC with renal failure. The method is useful for determining an individual's 

CC susceptibility to the progression of renal failure. The nucleic acid 

CC comprising a T(-509)C polymorphism of TGFB1 gene, or a polypeptide 

CC comprising a sequence of 391 amino acids is useful for preparing a 

CC medicament for retarding or preventing the progression of renal disease, 

CC and for drug research purposes for retarding or preventing the 

CC progression of renal disease. Sequences ABV75386-88 represents the 

CC protein sequence for the TGFB1 G-allele of the Arg25Pro polymorphism 

CC of exon 1. 

XX 

SQ Sequence 3 91 AA; 

Query Match 90.8%; Score 1920.5; DB 24; Length 391; 

Best Local Similarity 91.7%; Pred. No. 8.5e-162; 

Matches 366; Conservative 10; Mismatches 14; Indels 9; Gaps 2; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EA I RGQ I LSKLRLA 6 0 

I III III IMMM II I IMIMI I I I II I MM MUM MM I II I M MUM 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

IIIIMIIIIIMIIIIIMI i hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 179 

Mill : Ihll I I I I I I I I II II I I I M I ! I I I lllllllllllllllllhll 
Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 18 0 

Qy 180 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGF 239 

llllllllllllllllllllllllllllhl Illllllll I Ihllll hllll 

Db 181 RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 240 

Qy 24 0 NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEK 2 99 

MMMIII MMIMMMIIIMMII I I : I I II 
Db 241 TTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EK 2 92 

Qy 300 NCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 359 

1 1 1 1 MM 1 1 M 1 1 II II I II I M M II II M I ! I II I II II Mill I II II I III Ml 

Db 293 NCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 352 

Qy 360 AAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 398 

1 1 1 1 1 1 1 1 1 1 ! 1 1 ! 1 1 1 1 1 1 1 1 j 1 . 1 1 1 1 1 1 1 1 1 1 1 

Db 353 AAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 91 



RESULT 12 
AAR20124 

ID AAR20124 standard; Protein; 390 AA. 
XX 

AC AAR20124; 
XX 

DT 25-MAR-2003 (updated) 

DT 16-APR-1992 (first entry) 

XX 



DE Sequence of simian transforming growth factor (TGF) beta-1. 
XX 

KW Hypertension therapy; hypotensive agent; blood pressure modulator. 
XX 

OS Monkey . 
XX 

FH Key 
FT Peptide 
FT Protein 
XX 

PN W09119513-A. 
XX 

PD 26-DEC-1991. 
XX 

PF 20-JUN-1991; 
XX 

PR 20-JUN-1990; 
XX 

PA (BRIM ) BRISTOL-MYERS SQUIBB CO. 
XX 

PI Oleson FB, Comereski CR; 
XX 

DR WPI; 1992-024199/03. 
DR N-PSDB; AAQ20289. 
XX 

PT Use of transforming growth factor (TGF) -beta and their 
PT antagonists - for modulating blood pressure, for treating 
PT hypertension and hypotension 
XX 

PS Disclosure; Fig 1; 42pp; English. 
XX 

CC A new method for treating hypertension comprises administering a 
CC transforming growth factor (TGF) -beta to an individual at a dose 
CC effective for lowering blood pressure; the TGF-beta may be e.g. 
CC mature TGF-beta, TGF-beta2, a mature TGF-betal/beta2 hybrid, TGF- 
CC betal precursor, a latent TGF-beta2 precursor, hybrid TGF -beta 1 /TGF - 
CC beta2 precursor, a latent TGF-betal complex or a latent TGF-beta2 
CC complex. 

CC (Updated on 25-MAR-2003 to correct PA field.) 
XX 

SQ Sequence 3 90 AA; 



Location/Qualifiers 
8 . .21 
279. .390 



91WO-US04449. 
90US-0541221. 



Query Match 90.8%; Score 1919; DB 13 

Best Local Similarity 91.5%; Pred. No. l.le-161 
Matches 364; Conservative 10; Mismatches 16 



Length 390; 

Indels 8; Gaps 1; 



Qy 

Db 

Qy 
Db 

Qy 

Db 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 

I lllllllllllllllllllll lllllllllllllllllllllll llllllllllll 
1 MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTIDMELVKRKRIETIRGQILSKLRLA 



61 



60 



60 



120 



SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

II 1 1 h M I > 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 lllllllllllllllllllllh hi 

61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 



121 



180 



YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 

Mill = 11 = 11 ! I III I. I I I M I I I I I I I i I II : II I II I I I I ; I I I I I I I h I II 

121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 



Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

I M i I ( 1 1 1 : i I M 1 1 1 1 II 1 1 1 1 1 M 11111:11 I IMii hill 

Db 181 YLSNRLLAPSNSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSKDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 300 

: I I I I I I I Ml I I M I I I I I I I I I I I I I I I I lllllllllllllllll Ml 
Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

1 1 1 1 1 1 1 i I IN 1 1 ! I li 1 : 1 1 1 1 1 1 1 M 1 1 II 1 1 1 M 1 1 1 1 1 1 Ml 1 1 1 1 1 1 II 1 1 1 1 

Db 2 93 CCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

M 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1! I M II 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 13 




AAR46227 




ID 


AAR46227 standard; Protein; 390 AA. 


XX 






AC 


AAR46227; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


09-JUL-1994 


(first entry) 


XX 






DE 


Human pre-TGF 


-beta-1 . 


XX 






KW 


TGF-beta-1; TGF-beta-2; transforming growth factor bet, 


KW 


transforming growth factor beta-3; recombinant; wound : 


KW 


vulnerary . 




XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Location/Qualifiers 


FT 


Peptide 


279. .390 


FT 




/ label = Mat_peptide 


FT 


Cleavage-site 


279 


FT 




/note= "TGF-beta-1 release site" 


FT 


Modif ied-site 


82. .84 


FT 




/label= N-glycosylation site 


FT 


Modif ied-site 


136.. 138 


FT 




/label= N-glycosylation site 


FT 


Modif ied-site 


176 . . 178 


FT 




/label= N-glycosylation_site 


XX 






PN 


US5284763-A. 




XX 






PD 


08-FEB-1994 . 




XX 






PF 


04-MAR-1992; 


92US-0845893 . 


XX 






PR 


22-MAR-1985; 


85US-0715142. 


PR 


13-MAR-1987; 


87US-0025423 . 


PR 


04-AUG-1989; 


89US-0389929. 


PR 


04-MAR-1992; 


92US-0845893 . 



XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynk RMA, Goeddel DV; 
XX 

DR WPI; 1994-056343/07. 

DR N-PSDB; AAQ56923 . 
XX 

PT Nucleic acid sequences encoding transforming growth factor-beta - 

PT diagnostic probes, and for use in therapeutics 

XX 

PS Disclosure; Fig lb; 25pp; English. 
XX 

CC cDNA sequences were determined for human pre-TGF-beta-1 (AAQ56923), 

CC pig TGF-beta-3 (AAQ56925) and human TGF-beta-3 (AAQ56926) , and the 

CC corresponding amino acid sequences were determined (AAR46227-29 , 

CC respectively) . A genomic fragment corresponding to a human TGF- 

CC beta-1 exon (AAQ56924) was also isolated and its amino acid sequence 

CC determined (AAR4623 0) . The sequences have been used in the 

CC construction of vectors for the expression of recombinant TGF- 

CC beta. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 390 AA; 

Query Match 90.6%; Score 1915; DB 15; Length 390; 

Best Local Similarity 91.5%; Pred . No. 2.6e-161; 

Matches 364; Conservative 10; Mismatches 16; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I lllilllll IIIIIMIIM II 1 1 1 II 1 1 1 II II I ' 1 1 1 1 1 1 II 1 1 II 1 1 1 ! Ml 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

I I I I h I I I I I I I I I I I I I I I I I III : I I I I I IN | , | M | | | Ml | | | | | |: 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEREPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill = I M II I I I I I I I I I I I I I I I I I I I MM I I I I I I I I II I I I I I II MM 
Db 121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

M M I II 1 1 1 1 I II 1 1 1 1 1 1 1 1 Ml I M I lllilllll I IMMM MINI 

Db 181 YLSNRLLAPSDS PEWLSFDVTGVVRQWLSRGGE I EGFRLSAHCSCDSRDNTLQVDI NGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

M I M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 M III M I: lllllllllllllllll III 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 292 

Qy 301 CCVRQLY I DFRKDLGWKW I HE PKG YHANFCLGPCP YI WSLDTQYSKVLALYNQHNPGASA 360 

I I Ml I II M I M IM 1 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 M II M 1 1 1 M I II I IMM 1 1 1 Ml 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MMIMMIMMMMMIMMIMM I II 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 14 
AAR83054 

ID AAR83054 standard; Protein; 391 AA. 
XX 

AC AAR83054; 
XX 

DT 25-JUN-1996 (first entry) 
XX 

DE Transforming growth factor-beta 1. 
XX 

KW macrophage inducible nitric oxide synthase; iNOS; constitutive NOS; 

KW interleukin-l-beta; transforming growth factor-beta; TGF-beta; ILl-beta; 

KW nitric oxide production; hypotension; inflammation; septic shock; 

KW treatment. 

XX 

OS Mammalian sp. 
XX 

FH Key Location/Qualifiers 
FT Protein 279 . .391 

FT /note= "represents the mature active TGF beta-1 mol . " 

XX 

PN W09526745-A1. 
XX 

PD 12-OCT-1995. 
XX 

PF 05-APR-1994; 94WO-US03705 . 
XX 

PR 05-APR-1994; 94WO-US03 705 . 
XX 

PA (HARD ) HARVARD COLLEGE . 
XX 

PI Lee M, Perrella MA; 
XX 

DR WPI; 1995-358443/46. 
DR N-PSDB; AAT05876. 
XX 

PT Treatment of hypotension, esp . in septic shock - by administering 
PT transforming growth factor-beta e.g. to inhibit inducible nitric 
PT oxide synthase gene transcription 
XX 

PS Disclosure; Fig 17; 52pp; English. 
XX 

CC Transforming growth factor-beta 1 (TGF-beta 1) has been found to inhibit 

CC inducible nitric oxide synthase (iNOS) gene transcription, esp. in 

CC mterleukin-l-beta (ILl-beta) stimulated rat smooth muscle cells, and at 

CC a dose which does not inhibit consitutive NOS. TGF-beta 1 or 2 

CC (AAR83055) or their active fragments (esp. derived from the 

CC carboxy-terminal 112 amino acids) , can be used in the treatment of 

CC hypotension, such as that associated with severe inflammation or septic 

CC shock. 

XX 

SQ Sequence 391 AA; 



Query Match 90.5%; Score 1913.5; DB 16; Length 391; 

Best Local Similarity 91.5%; Pred. No. 3.5e-161; 

Matches 365; Conservative 10; Mismatches 15; Indels 9; Gaps 2; 



Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I 1 1 1 1 1 1 M i M II 1 1 1 1 1 1 1 ! I IIMIIIIIIIIIIIIIIIIMIIIIIIIIIMII 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQ1 120 

1 1 1 1 1 h 1 1 Ml II 1 1 II 1 1 1 1 1 II 1 1 1 M lllllllllllllllllllllh hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 179 

Mill : Ihll IIIIIIIIIIMIIIIIIIIIIII MMIIIIMMIIIIIhll 
Db 121 YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 18 0 

Qy 18 0 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGF 23 9 

, 1 1 II 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 h MINIMI I Ihllll 1 = 1111 

Db 181 RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 24 0 

Qy 24 0 NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEK 2 99 

M 1 1 1 1 1 1 Mill 1 1 II II I M I II M I M I I Ml I II II II I II 1 1 1 II 

Db 241 TTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EK 2 92 

Qy 3 00 NCCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHNPGAS 35 9 

1 1 1 1 ! 1 1 1 1 ! I I 1 1 1 ! 1 : 1 1 1 1 1 1 MINI : 1 1 < ' I 1 1 1 1 , 1 , 1 1 1 1 : 1 1 ' . 1 1 1 1 

Db 293 NCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 352 

Qy 360 AAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

M I M I M 1 1 II II 1 1 1 MM 1 1 1 1 M M 1 1 1 MM 1 1 1 

Db 353 AAPCCVPQALEPLPIWYVGRKPKVEQLSNMIWSCKCS 391 



RESULT 15 




AAW78785 




ID 


AAW78785 standard; Protein; 390 AA. 


XX 






AC 


AAW78785; 




XX 






DT 


25-MAR-2003 


(updated) 


DT 


21-DEC-1998 


(first entry) 


XX 






DE 


Human pre- transforming growth factor-beta 1. 


XX 






KW 


Transforming growth factor-beta 1; TGF-beta 1; human 


XX 






OS 


Homo sapiens. 




XX 






FH 


Key 


Location/Qual if iers 


FT 


Domain 


8 . .23 


FT 




/note= "hydrophobic domain" 


FT 


Protein 


279. .390 


FT 




/label= Mat protein 


FT 


Modif ied-site 


82.. 84 


FT 




/note= "Asn is N-glycosylated" 


FT 


Modif ied-site 


136.. 138 


FT 




/note= "Asn is N-glycosylated" 


FT 


Modi f ied-site 


176. .178 


FT 




/note= "Asn is N-glycosylated" 


FT 


Cleavage-site 


277.. 278 



FT /note= "cleavage site for relase of TGF-beta 1" 

XX 

PN US5801231-A. 
XX 

PD 01-SEP-1998. 
XX 

PF 30-MAY-1995; 95US- 0454468 . 
XX 

PR 13-MAR-1987; 87US-0025423 . 

PR 22-MAR-1985; 85US- 0715142 . 

PR 04-AUG-1989; 89US- 038 9 92 9 . 

PR 04-MAR-1992; 92US- 0845893 . 

PR 05-NOV-1993; 93US- 0147364 . 

PR 30-MAY-1995; 95US-0454468 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Derynck RMA, Goeddel DV; 
XX 

DR WPI; 1998-494840/42. 

DR N-PSDB; AAV52 933 . 
XX 

PT DNA encoding transforming growth factor-beta precursor sequence - 

PT useful for analysis to perform manipulations to increase yield of 

PT recombinant production of the protein 
XX 

PS Example 3; Fig IB 1-3; 26pp; English. 
XX 

CC This is the amino acid sequence of human transforming growth 

CC factor-beta 1 precursor (preTGF-beta 1) . It was deduced from 

CC a preTGF-beta 1 cDNA sequence (see AAV52933) . The invention relates 

CC to the recombinant production of TGF-beta. Biologically active 

CC TGF-beta is defined as being capable of inducing EGF-potentiated 

CC anchorage independent growth of target cell lines and/or growth 

CC inhibition of neoplastic cell lines. Nucleic acids encoding 

CC TGF-beta have been isolated and cloned into vectors which are 

CC replicated in bacteria and expressed in eukaryotic cells. TGF-beta 

CC recovered from transformed cells is used in known therapeutic 

CC applications. 

CC (Updated on 25-MAR-2003 to correct PF field.) 
XX 

SQ Sequence 390 AA; 

Query Match 90.5%; Score 1913; DB 19; Length 390; 

Best Local Similarity 91.2%; Pred. No. 3.9e-161; 

Matches 363; Conservative 10; Mismatches 17; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

i iiiiiMiiiiiiiiiiiilii n iiiiiiiiiii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAPGLSTCKTI DMEQVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

MlllhllMMIIIIIIMIIIIIIMIII MIMIIIIIIMIIIIIIIIh hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll 1 1 I > 1 1 1 1 1 1 1 M II I II 1 1 M I II 1 1 1 1 1 1 1 Mill! 1 1 h 1 1 1 



Db 



121 YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 



Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

I II II I II 1 1 1 1 M II II II 1 1 1 II II I M IN MM II I MM I II IMMI 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

: I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I II I I I 1 I I I III 
Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 2 92 

Qy 3 01 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 60 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIMIIIIIIII 

Db 2 93 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 52 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

I II M M M I M M M 1 1 II M M I M M 1 1 M II M I 

Db 3 53 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



Search completed: October 28, 2003, 09:06:47 
Job time : 34.2026 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2 003 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 
Perfect score: 2114 



October 28, 2003, 09:09:54 ; Search time 24.1473 Seconds 

(without alignments) 
2760.110 Million cell updates/sec 

US-10-017-372E-13 



Sequence : 



1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNMI VRSCKCS 398 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



629382 seqs, 167460630 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



629382 



Database 



Published__Applicat ions_AA : * 

1 .- /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB.pep: 

2 : /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB . pep : * 

3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep 

4 : /cgn2__6/ptodata/2/pubpaa/US06_PUBCOMB.pep 

5 : / cgn2__6 /p t oda t a / 2 /pubpaa / US 0 7_NEW_PUB . pep 

6 : /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep 



7 : 


/cgn2_ 


8 : 


/cgn2_ 


9 : 


/cgn2_ 


10 


/cgn2 


11 


/cgn2^ 


12 


/cgn2 


13 


/cgn2^ 


14 


/cgn2 


15 


/cgn2 


16 


/cgn2 


17 


/cgn2 


18 


/cgn2 



6 /pt oda t a / 2 /pubpaa/US 0 8_NEW_PUB . pep : * 
6 /pt oda t a / 2 /pubpa a /US 0 8_PUBCOMB . pep : * 
6 /ptoda t a/ 2 /pubpaa/US 0 9A_PUBCOMB . pep : * 
6/ptodata/2/pubpaa/US09B_PUBCOMB.pep: 
6/ptodata/2/pubpaa/US09C__PUBCOMB . pep : 
6/ptodata/2/pubpaa/US09__NEW_PUB.pep: * 
6/ptodata/2/pubpaa/US10A_PUBCOMB.pep; 
6 /Dtodata /2 /DubDaa /US10B PUBCOMB . Deo : 



j6/ptodata/2/pubpaa/US10C_PUBCOMB.pep 
_6/ptodata/2/pubpaa/US10_NEW_PUB .pep: 
_6/ptodata/2/pubpaa/US60_NEW_PUB .pep: 
_6/ptodata/2/pubpaa/US60_PUBCOMB . pep : 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 



Score 



Query 

Match Length DB ID 



Description 



1 


2035 


96 


3 


390 


11 


US- 


09 


-214 


-592-26 


Sequence 


26, 


Appl 


2 


1934 


91 


5 


390 


11 


us- 


09 


-214 


-592-29 


Sequence 


29, 


Appl 


3 


1931 


91 


3 


390 


15 


us- 


10 


-087 


-268-2 


Sequence 


2, 


Appli 


4 


1924 


91 


0 


390 


15 


us- 


10 


-087 


-268-5 


Sequence 


5, 


Appli 


5 


1921 


90 


9 


390 


12 


us- 


10 


-276 


-947-1 


Sequence 


1 / 


Appli 


6 


1919 


90 


8 


390 


11 


us- 


09 


-214 


-592-33 


Sequence 


33, 


Appl 


7 


1913 .5 


90 


5 


391 


11 


us- 


09 


-214 


-592-17 


Sequence 


17, 


Appl 


8 


1910 


90 


4 


390 


10 


us- 


09 


-756 


-283A-23 


Sequence 


23, 


Appl 


9 


1898 


89 


8 


390 


11 


us- 


09 


-214 


-592-28 


Sequence 


28, 


Appl 


10 


1840 


87 


0 


390 


11 


us- 


09 


-214 


-592-20 


Sequence 


20, 


Appl 


11 


1840 


87 


0 


390 


11 


us- 


09 


-214 


-592-23 


Sequence 


23, 


Appl 


12 


1577 


74 


6 


315 


11 


us- 


09 


-214 


-592-25 


Sequence 


25, 


Appl 


13 


1262 . 5 


59 


7 


455 


10 


us- 


09 


-756 


-283A-20 


Sequence 


20, 


Appl 


14 


1139 


53 


9 


447 


10 


us- 


09 


-756 


-283A-22 


Sequence 


22, 


Appl 


15 


1063 .5 


50 


3 


373 


11 


us- 


09 


-214 


-592-32 


Sequence 


32, 


Appl 


16 


876 


41 


4 


412 


11 


us- 


09 


-214 


-592-31 


Sequence 


31, 


Appl 


17 


874 


41 


3 


382 


11 


us- 


09 


-214 


-592-34 


Sequence 


34, 


Appl 


18 


873 . 5 


41 


3 


383 


10 


us- 


09 


-756 


-283A-27 


Sequence 


27, 


Appl 


19 


838 . 5 


39 


7 


409 


11 


us- 


09 


-214 


-592-27 


Sequence 


27, 


Appl 


20 


832 


39 


4 


410 


11 


us- 


09 


-214 


-592-22 


Sequence 


22, 


Appl 


21 


828 


39 


2 


412 


11 


us- 


09 


-214 


-592-24 


Sequence 


24, 


Appl 


22 


826 


39 


1 


412 


11 


us- 


09 


-214 


-592-19 


Sequence 


19, 


Appl 


23 


826 


39 


1 


412 


14 


us- 


10 


-028 


-158-21 


Sequence 


21, 


Appl 


24 


822 


38 


9 


412 


10 


us- 


09 


-756 


-283A-25 


Sequence 


25, 


Appl 


25 


821 . 5 


38 


9 


414 


11 


us- 


09 


-214 


-592-21 


Sequence 


21, 


Appl 


26 


808 . 5 


38 


2 


304 


10 


us- 


09 


-756 


-283A-26 


Sequence 


26.. 


Appl 


27 


bU6 . 5 


38 


2 


414 


10 


us- 


09 


-756 


-283A-24 


Sequence 


24, 


Appl 


28 


806.5 


38 


2 


414 


11 


us- 


09 


-214 


-592-18 


Sequence 


18, 


Appl 


29 


805.5 


38 


1 


412 


11 


us- 


09 


-214 


-592-30 


Sequence 


30, 


Appl 


30 


758 


35 


9 


139 


14 


us- 


10 


-002 


-278-8 


Sequence 


8, 


Appli 


31 


642 


30 


4 


115 


10 


us- 


09 


-859 


-211-47 


Sequence 


47, 


Appl 


32 


642 


30 


4 


115 


10 


us- 


09 


-880 


-708-25 


Sequence 


25, 


Appl 


33 


642 


30 


4 


115 


11 


us- 


09 


-872 


-856-47 


Sequence 


47, 


Appl 


34 


642 


30 


4 


115 


15 


us- 


10 


-335 


-483-29 


Sequence 


29, 


Appl 


35 


634 


30 


0 


114 


10 


us- 


09 


-813 


-459-22 


Sequence 


22, 


Appl 



36 


634 


30 


0 


114 


14 


US- 


10 


-115 


-406-21 


Sequence 


21, Appl 


37 


634 


30 


0 


114 


15 


US- 


10 


-154 


-333-23 


Sequence 


23, Appl 


38 


624 


29 


5 


112 


10 


us- 


09 


-813 


-271B-2 


Sequence 


2, Appli 


39 


624 


29 


5 


113 


10 


us- 


09 


-813 


-398-13 


Sequence 


13, Appl 


40 


562 


26 


6 


98 


12 


us- 


10 


-187 


-394-1 


Sequence 


1, Appli 


41 


548 


25 


9 


116 


14 


us- 


10 


-115 


-406-24 


Sequence 


24, Appl 


42 


548 


25 


9 


116 


15 


us- 


10 


-154 


-333-26 


Sequence 


26, Appl 


43 


521 


24 


6 


112 


10 


us- 


09 


-813 


-271B-8 


Sequence 


8, Appli 


44 


499 


23 


6 


98 


12 


us- 


10 


-187 


-394-4 


Sequence 


4, Appli 


45 


497 


23 


5 


114 


14 


us- 


10 


-115 


-406-25 


Sequence 


25, Appl 



ALIGNMENTS 



RESULT 1 

US-09-214-592-26 

Sequence 26, Application US/09214592A 
Publication No. US20030027218A1 
GENERAL INFORMATION : 
APPLICANT: Yamasaki,CMotoo 
APPLICANT: Shibata , CKenj i 
APPLICANT: Sato , CYasuf umi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/214 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 26 
LENGTH: 3 90 
TYPE : PRT 
ORGANISM: porcine 
US-09-214-592-26 



Query Match 96.3%; Score 2035; DB 11 

Best Local Similarity 97.0%; Pred. No. 7.2e-180 
Matches 386; Conservative 1; Mismatches 3 



Length 3 90; 

Indels 8; Gaps 1; 



Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I II I II I i 1 1 M 1 1 11 1 1 ! 1 1 1 II 1 1 1 1 ! 1 1 1 ! 1 1 1 1 1 ! 1 1 1 1 1 ( 1 1 M 1 1 1 1 1 1 1 M ! 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

! I MM I IIIM 1 1 II 1 1 , 1 1 1 M II 1 1 1 1 M II 1 1 M 1 1 1 Ml II -' ! Ml I h III ! M 

Db 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMLESGNQT 170 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

MM M MM ! i I ! M M i M M MMM M ! !M I M. MM 1 1 M 1 1 Mi M I M,M M MM I 

Db 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

I MIMMMI IMMI 1 1 MM 1 1 M 1 1 1 1 M III Ml MM I 1 1 1 1 1 1 1 1 1 1 1 II II 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 24 0 



Qy 


241 


Db 


241 


Qy 


301 


Db 


293 


Qy 


361 


Db 


353 



-EKN 292 



IIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIMIIIIIIIIII 

!CVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVIALYNQHNPGASi 

iPCCVPQALEPLPI VYYVGRKPKVEQLSNMIVRSCKCS 398 
lllllllllllllllllllllllllllllllllllll 



RESULT 2 

US-09-214-592-29 

Sequence 29, Application US/09214592A 
Publication No. US20030027218A1 
GENERAL INFORMATION : 
APPLICANT: Yamasaki , CMOtoo 
APPLICANT: Shibata , CKenj l 
APPLICANT: Sato , CYasuf umi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 29 
LENGTH: 3 90 
TYPE: PRT 
ORGANISM: ovine 
US-09-214-592-29 

Query Match 91.5%; Score 1934; DB 11; Length 390; 

Best Local Similarity 91.7%; Pred. No. 1.6e-170; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I llllllllllllllllhllllll IIIMIIIIIIIIIMI lllllllllllllll 
Db 1 MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTI DMELVKRKGI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

M 1 1 M M M I ! 1 1 ! h 1 1 1 1 1 1 1 1 1 1 M I II I 1 1 1 II 1 1 1 M M 1 1 1 1 1 II I (hi 

Db 61 SPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

III I : Ihll MMMMMMIIMMhMMIMMIMMMMIMhlM 
Db 121 YDKMKSSSHS I YMFFNTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

IIIIIMIMIIIIIIIIIIIIIIIIMI II MINIMI I IMIMI hlllh 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFRLSAHCSCDSKDNTLQVDINGFS 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

1 1 1 M II II 1 1 II II II II I II II 1 1 M 1 1 II 1 1 1 1 II 1 1 1 1 II M 1 1 1 III 



Db 



241 SGRRGDI^TIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKN 2 92 



Qy 


301 


CCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 60 

IIIIIIIIIIIIMIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIMIIIII 


Db 


293 


CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 


Qy 


361 


APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

llllllllllllllllllllllllllllllllllllll 
APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 


Db 


353 



RESULT 3 
US-10-087-268-2 

Sequence 2, Application US/10087268 
Publication Wo. US200301 19010A1 
GENERAL INFORMATION: 
APPLICANT: Jonsonn, Julie Ruth 
APPLICANT : Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

FILE REFERENCE: Fibrosis 

CURRENT APPLICATION NUMBER: US/10/08 7,268 
CURRENT FILING DATE: 2002-03-01 
NUMBER OF SEQ ID NOS : 6 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 2 
LENGTH: 3 90 
TYPE: PRT 
ORGANISM: Human 
US-10-087-268-2 

Query Match 91.3%; Score 1931; DB 15; Length 390; 

Best Local Similarity 92.0%; Pred. No. 3.1e-170; 

Matches 366; Conservative 10; Mismatches 14; Indels 8; Gaps 1; 

.PSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAIRGQI LSKLRLA 6 0 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII1IIIMIII 



1 1 1 iih 1 1 1 1 1 1 ;i 1 1 'i i i:i i u 1 1 1 1 1 1 1 1 >ii i, 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h hi 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 
YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 

Mill = ihii ii i hi 1 1 1 1 ; ii h 1 1 : 1 1 1 1 mi 1 1 1 1 1 ii ii I'M i ih ;ii 

YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 
YLSNRLLAPSDS PEWLSFDVTGVVRQWLTRREAIEGFRLSAHSSSDSKDNTLHVE I NG FN 

MiMMMIIIMMIIIIIIIIIIIhl lllllllll I Ihllll hllll 



I ! I II i 1 1 1 1 1 1 II i II 1 1 1 1 1 1 1 1 1, 1 1 1 Ml I ill, 1 1 1 1 1 1 1 ! I III 

'GRRGDLAT I HGMNRPFLLLMATPLERAQHLQS SRHRRALDTNYCFS ST EKN 2 92 



1 1 1 ;ii 1 1 1 1 ; 1 1 1 1 1 i.ii 1 1 1 1 1 'i ii 1 1 1 : ii : iii ii 1 1 ;i 1 1 1 1 ii 1 1 1 ; 1 1 



Qy 


i 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 



Db 2 93 CC VRQLY I DFRKDLGWKW I HEP KG YHANFCLG PCP Y I WSLDTQYS KVLALYNQHN PGASA 352 



Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

! 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M ! 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 4 
US-10-087-268-5 

; Sequence 5, Application US/10087268 
; Publication No. US20030119010A1 
; GENERAL INFORMATION: 

APPLICANT: Jonsonn, Julie Ruth 
; APPLICANT: Powell, Elizabeth Ellen 

TITLE OF INVENTION: Polypeptides and polynucleotides linked to a disease or 
condition 

; FILE REFERENCE: Fibrosis 

; CURRENT APPLICATION NUMBER : US/ 1 0/ 087 , 268 
; CURRENT FILING DATE: 2 002-03-01 
; NUMBER OF SEQ ID NOS : 6 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 5 

LENGTH: 3 90 

TYPE : PRT 

ORGANISM: Human 
US-10-087-268-5 

Query Match 91.0%; Score 1924; DB 15; Length 390; 

Best Local Similarity 91.7%; Pred. No. 1.4e-169; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I llllllllllllllllllllll I . I; I III 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 MM I II 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKT I DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

I . MM llllllllllllllllllllll: hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll lllllllllllllllllllllllllllllllllllllllllhlll 

Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 240 

Illllllllllllllllllllllllllhi lllllllll I Ihllll hllll 
Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

^ I I I I I . I II I II II I I M I M : | 1 1 1 , 1 1 1 1 1 1 1 1 , 1 1 1 1 I 1 1 1 1 1 ill 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 292 

Qy 301 CC WQL Y I DFR KDLGWKW I HE PKGYHANFCLGP CP Y I WSLDTQYS KVLALYNQHN PGASA 360 

M M MMiMMM MMlMMI MM IM MMMMMMMI M Mi MMIM Mi II 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

II 1 1 1 MM II M II 1 1 1 M II M I M I M III I M M 



Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 5 
US-10-276-947-1 

Sequence 1, Application US/10276947 
Publication No. US20030176315A1 
GENERAL INFORMATION: 
APPLICANT: GLAXO GROUP LIMITED 
TITLE OF INVENTION : MEDICAL PRODUCTS 
FILE REFERENCE: PG3 94 9 

CURRENT APPLICATION NUMBER: US/ 10/276 , 947 
CURRENT FILING DATE: 2002-11-21 
PRIOR APPLICATION NUMBER : GB 0012991.6 
PRIOR FILING DATE: 2000-05-26 
PRIOR APPLICATION NUMBER: GB 0100286.4 
PRIOR FILING DATE: 2001-01-05 
NUMBER OF SEQ ID NOS : 1 
SOFTWARE: Pa tent In Ver. 2.0 
SEQ ID NO 1 
LENGTH: 3 90 
TYPE : PRT 

ORGANISM: Homo sapiens 
US-10-276-947-1 

Query Match 90.9%; Score 1921; DB 12; Length 390; 

Best Local Similarity 91.7%; Pred. No. 2.6e-169; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I 1 1 1 1 1 ' I IIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIMMMI 

Db 1 MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

iiiiihiiiiiiiiiMiiMiiiiiiiiii 1 1 1 ; i i i : m ii i ii 1 1 1 1 1 ih hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

I II 1 1 : Ihll IIIIIIIIIIIIIIIIIMMIMIMMIIIMMIMIIhlll 

Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

IIIIIMIIIIIIIMIIIIMIIIMMI MINIMI I IMIMI MUM 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEI EGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

MIMIIMMIMMIMIIIIIIIMIM I M 1 1 ! 1 1 1 1 1 1 1 1 f 1 1 Ml 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 2 92 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

MMMIMMMIMIMIMMMMMIMIMMMMMMMMIMMIMM 

Db 2 93 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

II II II II I II M II II I II II I I II II I I I I I I II I I 

Db 3 53 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 6 

US-09-214-592-33 

Sequence 33, Application US/09214592A 
Publication No. US2003 0027218A1 
GENERAL INFORMATION: 
APPLICANT: Yamasaki , CMotoo 
APPLICANT: Shibata,CKenji 
APPLICANT: Sato , CYasuf umi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY-REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
CURRENT FILING DATE : 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 33 
LENGTH: 3 90 
TYPE: PRT 
ORGANISM: simian 
US-09-214-592-33 

Query Match 9 0.8%; Score 1919; DB 11; Length 3 90; 

Best Local Similarity 91.5%; Pred. No. 4e-169; 

Matches 364; Conservative 10; Mismatches 16; Indels 8; Gaps 1; 

MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I 1 I! I I I I I M| I,! | | | | | I , | I I I I I I I I ; I I [| I I, I I i III I IMIIIIIIIII 

MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTI DMELVKRKRI ETI RGQI LSKLRLA 6 0 
SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

MlllhlllllMIIIIIIMIMIIMIII IIIMIIIIMIMMIIIIIh hi 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


293 


Qy 


361 


Db 


353 



in :i i ii i h m;i ii i ii ;i i 'in h i minimi i mimm mum 

YLSNRLLAPSN S P EWLS FDVTG WRQWLS RGGE I EGFRLSAHCSCDS KDNTLQVD I NGF' 
SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEK] 

mi 1 1 1 ii 1 1 1 ;'i m 1 1 1 1 1 1 m 1 1 ii i; 1 1 mi i ii 1 1 1 1 ii 1 1 1 ii 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKJ 

CCVRQLY I DFR KDLGWKW I HE PKGYHANFCLGPCPY I WSLDTQYSKVLALYNQHNPGASi 

i i i 1 1 M 1 1 II 1 1 1 1 1 1 1 1 !! 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II II 1 1 1 1 1 II 1 1 

CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASJ 
APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

MMMMMIMMMMMMMIMIMMIIMI 



RESULT 7 



US-09-214-592-17 

Sequence 17, Application US/09214592A 
Publication No. US2003 0027218A1 
GENERAL INFORMATION : 
APPLICANT: Yamasaki , CMotoo 
APPLICANT: Shibata , CKen j i 
APPLICANT: Sato,CYasufumi 

TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACT I VI TY- REGULATING COMPOUNDS 
FILE REFERENCE: 11060 

CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 17 
LENGTH: 3 91 
TYPE : PRT 
ORGANISM: human 
US-09-214-592-17 

Query Match 90.5%; Score 1913.5; DB 11; Length 391; 

Best Local Similarity 91.5%; Pred. No. 1.3e-168; 

Matches 365; Conservative 10; Mismatches 15; Indels 9; Gaps 2; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

i MiiiiiMiiiiiiiiiiiii M 1 1 ii :i 1 1 1 1, 1 '! ii 1 1 1 mim i: 1 1 1 1 1 mi 

Db 1 MP PSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLiALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1 1 , i hi 1 1 1 m 1 1 1 1 1 1 1 1 1 1! i iii i ii 1 1 ii i ;i n 1 1 1 ii ii i h 1 1 h hi 

Db 61 S P PSQGEVP PGPLPEAVLALYNSTRDR VAGESAE PE P E PEADYYAKEVTR VLMVETHNE I 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSW 179 

I II II : I hi I Mil MM I MIM III III MM II M II I II 1 1 II I II II M I 

Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 180 

Qy 180 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGF 239 

II MUM II I Ml II II I II Ml MM hi II II Mill I MM III hi III 

Db 181 RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 24 0 

Qy 24 0 NSGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEK 299 

= M 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 

Db 241 TTGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EK 2 92 

Qy 3 00 NCCVRQLYIDFRKDLGWKWI HEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 3 59 

1 1 M M 1 1 1 1 1 1 1 1 M M 1 1 M II M II I II M M M I ! M 1 1 II II 1 1 M I M 1 1 1 M I 

Db 2 93 NCCVRQLYIDFRPCDLGWKWIHEPKGYHANFCLGPCPYIWSLDTOYSKVLALYNQHNPGAS 3S2 

Qy 3 60 AAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 3 98 

II M I II M M II II II 1 1 1 1 M II II M M II II II M 

Db 353 AAPCCVPQALEPLPIVYYVGRKPKVEQLSNM I VRSCKCS 391 



RESULT 8 

US-09-756-283A-23 

; Sequence 23, Application US/09756283A 



Patent No. US20020151478A1 
GENERAL INFORMATION: 
APPLICANT: Chernajovsky, Yuti 
APPLICANT: Dreja, Hanna Stina 
APPLICANT: Adams, Gillian 
TITLE OF INVENTION: Latent Fusion Protein 
FILE REFERENCE: 0623.1000000 
CURRENT APPLICATION NUMBER: US/ 09/7 5 6 , 283A 
CURRENT FILING DATE: 2001-01-09 
NUMBER OF SEQ ID NOS : 100 
SOFTWARE: Patentln version 3.0 
SEQ ID NO 23 
LENGTH: 3 90 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-756-283A-23 

Query Match 90.4%; Score 1910; DB 10; Length 390; 

Best Local Similarity 91.2%; Pred . No. 2.7e-168; 

Matches 363; Conservative 10; Mismatches 17; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IIIIIIIIIIMIIIIIIIIII IMMMIIIIIMMIMIIIIIIIIIIIIIIII 
Db 1 MP PSGLRLLPLLL PLLWLLVLTPG PPAAGLSTCKT I DMELVKRKR I EAI RGQI LS KLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I I I I I I : I I I I II I I I I I I I I I I I I I I I I I I I lllllllllllllllllllllh 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHHEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

UNI : II II II | II 1 1 !l 1 1 1 1 1 1 1! 1 1 II I ! !l II ' I II I I I I I I h I II 

Db 121 YDKFKQSTHSTYMFFNISELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

IIIIIIIIIIIIIIIIIMIIIIIIIIh M I llll II I IIHIII hi III 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

: 1 1 1 II 1 1 1 1 1 M II III I III I, 1 1 1 III 1 1 IMIIMIilllll.il Ml 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 2 92 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

MMM II , M 1 1 M II MM 1 M I II 1 1 1 II II 1 1 1 MM IIMM MMM I II II II MM II 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

1 1 1 1 1 1 M 1 1 1 1 II I M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 M 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 9 
US-09-214-592-28 

; Sequence 28, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj i 



; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
FILE REFERENCE : 11060 

CURRENT APPLICATION NUMBER: US/09/214 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 28 
LENGTH: 3 90 
TYPE : PRT 
ORGANISM: canine 
US-09-214-592-28 

Query Match 89.8%; Score 1898; DB 11; Length 3 90; 

Best Local Similarity 90.7%; Pred. No. 3.5e-167 ; 

Matches 361; Conservative 12; Mismatches 17; Indels 8; Gaps 1; 

MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 6 0 

I illllMIIIMII 1 1 1 1 1 1 1 ! 1 1 1 ! 1 1 1 1 ( I ! 1 1 [ 1 1 1 ( I ! 1 1 1 1 1 1 1 M 1 1 1 1 1 = 

MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLS 6 0 

SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhlll lllllllllllllllllllll lllllllllllllllllllllh hi 
SPPSQGEVPPVPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVENTNKI 12 0 

YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I UlhlllllllllllllllllMIIIIIIIIIIIII IIMIMIIIIIMM 
YEKVKKSPHSI YMLFNTSELREAVPEPVLLSRAELRLLRLKLKAEQHVELYQKYSNDSWR 18 0 

YLSNRLLAP SDS PEWLS FDVTGWRQWLTRREA I EGFRLSAHSS SDS KDNTLHVE INGFN 24 0 
IIIIMIIIIhlllMIIIIIMIIIh :|MIIIII I M I I I I I hlllh 



Qy 


l 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


293 


Qy 


361 


Db 


353 



-EKN 292 



I 1 1 I ! 1 1 ' i ' , , ' i ' 1 1 1 1 ' , I ' 1 , 1 1 1 1 1 I I ' 1 ! i ! III.'! I ! ! 1 1 1 1 

CCVRQLYIDFRKDLGW?CWIHEPKGYHANFCLGPCPYIWSLDTQYSKVTALYNQHNPGASi 
APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

1 1 1 ; 1 1 1 1 1 ii iii : 1 1 , n i : 1 1 1 1 1 1 h h 1 1 1 1 m 



RESULT 10 
US-09-214-592-20 

; Sequence 20, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj i 

; APPLICANT: Sato , CYasuf umi 



; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/09/214 , 592A 

; CURRENT FILING DATE : 1999-01-18 

; NUMBER OF SEQ ID NOS : 34 

; SOFTWARE : 

; SEQ ID NO 20 

; LENGTH: 390 

TYPE : PRT 

ORGANISM: murine 
US-09-214-592-20 

Query Match 87.0%; Score 1840; DB 11; Length 390; 

Best Local Similarity 86.9%; Pred. No. 8.3e-162; 

Matches 346; Conservative 15; Mismatches 29; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

I I II 1 1 i II 1 1 1 II IIIIIIIIIIIMIIIMMIIIIIIIIIIMMIIIMIIMI 

Db 1 MPPSGLRLLPLLLPLPWLLVLT PGR PAAGLSTCKT I DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

lllllhlllllllllllllllllllllllll :MIMMIMIIIIIIIII|: I I 
Db 61 S P PS QGE VP PG PL P EAVLAL YNS TRDR VAG ESAD P E P E P EAD YYAKE VTR VLMVDRNNA I 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

hi I Ihll IMhMMIMI MIIIIIM III 1 1 ! I !, 1 1 1 1 1 1 1 :! 1 1 

Db 121 YEKTKDISHSI YMFFNTSDIREAVPEPPLLSRAELRLQRLKSSVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

II MM h ! I M 1 1 1 1 1 1 1 , 1 1 1 1 1 : : MMI Ml I Mill IMIMI : 

Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGIQGFRFSAHCSCDSKDNKLHVEINGIS 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

Mill Ml MMMMIMMMMMMIIMIMIMIIMM III 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKN 2 92 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

MMMMMMMMMMMMMMMMMIMMIMMMMMIIIMIMM 
Db 2 93 CCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHNPGASA 3 52 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MMIMIMMMIMIMMMIMMMMIIMI 

Db 353 SPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 11 
US-09-214-592-23 

; Sequence 23, Application US/09214592A 

; Publication No. US200300272 18A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKen j i 

; APPLICANT: Sa to , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 



; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
; CURRENT FILING DATE : 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 
; SOFTWARE : 
; SEQ ID NO 23 

LENGTH: 390 

TYPE: PRT 

ORGANISM: rat 
US-09-214-592-23 

Query Match 87.0%; Score 184 0; DB 11; Length 3 90; 

Best Local Similarity 86.9%; Pred. No. 8.3e-162; 

Matches 346; Conservative 14; Mismatches 30; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I IMIIMIMMI I I I II II I II II I I II II II I II II II I I I II II I I I II II II I 

Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhlllMIIIIIIMMIMIIIIIII Ullllllllllllllllllh II 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

III I Ihll MlhMIMMI MINIM, I I IIIIIIIIIMIhlll 

Db 121 YDKTKDITHSI YMFFNTSDIREAVPEPPLLSRAELRLQRFKSTVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

I I MM IMMI IMM llllll III : : |:||| III I IMM I II MM 

Db 181 YLGNRLLT PTDT P EWLS FD VTG WRQWLNQGDG I QGFR FS AHCS CDS KDN VLH VE I NG I S 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

Mill III I II II 1 1 1 II ! M! 1 1 1 M 1 1 1 1 1 1 MM I M I M Ml Ml 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKN 2 92 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

I I I MM Ml I II Ml 1 1 1 1 M I M 1 1 MM 1 1 n MMMM IMM MM i ; 1 1 M M M MMM 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYPiANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 3 98 

M I II IM IMM 1 1 IMM I II M MM 1 1 1 ' 1 M 1 1 1 

Db 353 SPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 12 
US-09-214-592-25 

; Sequence 25, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

; APPLICANT: Shibata , CKenj l 

; APPLICANT: Sa to , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACT I VI T Y - REGULATI NG COMPOUNDS 
; FILE REFERENCE: 1106 0 



CURRENT APPLICATION NUMBER: US/09/2 14 , 592A 
CURRENT FILING DATE: 1999-01-18 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE : 
SEQ ID NO 25 
LENGTH: 315 
TYPE : PRT 
ORGANISM: bovine 
US-09-214-592-25 

Query Match 74.6%; Score 1577; DB 11; Length 315; 

Best Local Similarity 91.3%; Pred. No. 1.4e-137; 

Matches 295; Conservative 9; Mismatches 11; Indels 8; Gaps 1; 
Qy 7 6 AVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLF 13 5 

h I M M M II 1 1 II II I IMMMIIIMIIMMII Ihllll I = Ihll I 

Db 1 AI LALYNSTRDRVAGESAETE PEPEADYYAKEVTRVLMVEYGNKI YDKMKSSSHS I YMFF 60 

Qy 136 NTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEW 195 

IIIMIIIIIIIIIIIIh:||IIIIIMIII!lllllllhll!IIIIIIIIMIIIII 
Db 61 NTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEW 120 

Qy 196 LSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNR 255 

u 1 1 1 . 1 1 1 1 ii 1 1 1 1 1 1 1 1 ■ 1 1 , i Mini ' n 1 1 hi : 1 1 ii 1 1 1 ii 1 1 1 ! 

Db 121 LSFDVTGWRQWLTRREEIEGFRLSAHCSCDSKDNTLQVDINGFSSGRRGDLATIHGMNR 180 

Qy 2 56 PFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLG 315 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 ! 1 1 I 1 1 i 1 1 I 1 I 1 1 1 I I M 1 1 I I ' M I 

Db 181 PFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKNCCVRQLYIDFRKDLG 232 

Qy 316 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIV 375 

IIIIIMIIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 233 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIV 2 92 

Qy 376 YYVGRKPKVEQLSNMI VRSCKCS 398 

Mlllillllllllllllltlll 
Db 2 93 YYVGRKPKVEQLSNMI VRSCKCS 315 



RESULT 13 
US-09-756-283A-20 

; Sequence 20, Application US/09756283A 

; Patent No. US20020151478A1 

; GENERAL INFORMATION: 

; APPLICANT: Cherna j ovsky , Yuti 

APPLICANT: Drej a , Hanna St ma 
; APPLICANT: Adams, Gillian 
; TITLE OF INVENTION: Latent Fusion Protein 

FILE REFERENCE: 0623.1000000 
; CURRENT APPLICATION NUMBER: US/09/756 , 283A 
; CURRENT FILING DATE: 2001-01-09 
; NUMBER OF SEQ ID NOS: 100 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 20 
LENGTH: 455 
TYPE: PRT 

ORGANISM: Artificial Sequence 



FEATURE : 

OTHER INFORMATION: LAP -ml FN construct 
US-09-756-283A-20 



Query Match 59.7%; Score 1262.5; DB 10; Length 455; 

Best Local Similarity 90.5%; Pred. No. 3.1e-108; 

Matches 248; Conservative 10; Mismatches 15; Indels 1; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQI LSKLRLA 60 

i 1 1 1 , 1 1 1 ! i : i , 1 1 1 1 1 , 1 1 ! i ii i m 1 1 1 1 ii 1 1 u i i i m ! : 1 1 , 1 1 1 m : 1 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGPPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVXALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

IMIMMMMMMMMMMMMMII lllllllllllllllllllllh hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLL-RLICLKVEQHVELYQKYSNDSW 179 

Mill : Ihll IIIIIIIIIIIIIIIIIIIIIMI 1 I I I I ' I I I ' I ! I I I h 
Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSW 18 0 

Qy 180 RYLSNRLLAPSDS PEWLS FDVTGVVRQWLTRREAI EGFRLS AHS S SDS KDNTLHVE I NGF 239 

IIIIIMIIIIIIIIIMIIIIIIMIIhl MINIMI I Ihllll hllll 

Db 181 RYLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGF 24 0 

Qy 240 NSGRRGDLATI HGMNRPFLLLMATPLERAQHLHS 273 

M M II I M 1 1 1 M I M M I M M M I M M I 

Db 241 TTGRRGDLATI HGMNRPFLLLMATPLERAQHLQS 274 



RESULT 14 
US-09-756-283A-22 

; Sequence 22, Application US/09756283A 
; Patent No. US20020151478A1 
; GENERAL INFORMATION: 

APPLICANT: Cherna j ovsky , Yuti 
; APPLICANT: Dreja, Hanna Stina 
; APPLICANT: Adams, Gillian 
; TITLE OF INVENTION: Latent Fusion Protein 
; FILE REFERENCE: 0623.1000000 
; CURRENT APPLICATION NUMBER: US/09/756,283A 
; CURRENT FILING DATE: 2001-01-09 
; NUMBER OF SEQ ID NOS : 100 

SOFTWARE: Patentln version 3.0 
; SEQ ID NO 22 
LENGTH: 447 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

Other INFORMATION: ml FN -LAP construct 
US-09-756-283A-22 

Query Match 53.9%; Score 1139; DB 10; Length 447; 

Best Local Similarity 87.0%; Pred. No. 8.2e-97; 

Matches 228; Conservative 10; Mismatches 18; Indels 6; Gaps 3; 
Qy 14 PL-LWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGP 72 

m ii i ii 'in 1 1 ii i mi i , 1 1 1 ; li , i ii , i! ':i 1 1 1 h 1 1 iii 



Db 190 PLGLW AGGGSAAALSTCKTI DMELVKRKRI EAI RGQILSKLRLASPPSQGEVPPGP 245 

Qy 73 LPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLY 132 

II III II I Mill MM I II III Mill II II I llllllllh Mill III IMI 

Db 246 LPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHWEIYDKFKQSTHSIY 305 

Qy 133 MLFNTSELREAVPEPVLLSRAELRLL-RLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSD 191 

i 1 1 1 1 1 1 ( ( 1 1 1 1 r r 1 1 1 ri i r 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 m 1 1 1 h 1 1 1 1 f 1 1 1 1 1 1 1 1 1 

Db 306 MFFNTSELREAVPEPVLLSRAELRLLRRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSD 365 

Qy 192 SPEWLSFDVTGVVRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATIH 251 

IMIIIIIMIMIMMI Illllllll I Ihllll hllll MIMIIIIM 

Db 366 SPEWLSFDVTGVVRQWLSRGGEI EGFRLSAHCSCDSRDNTLQVDINGFTTGRRGDLATI H 42 5 

Qy 252 GMNRPFLLLMATPLERAQHLHS 273 

I II I M 1 1 1 1 M II 1 1 1 1 M I 

Db 426 GMNR PFLLLMATPLERAQHLQS 44 7 



RESULT 15 
US-09-214-592-32 

; Sequence 32, Application US/09214592A 

; Publication No. US20030027218A1 

; GENERAL INFORMATION: 

; APPLICANT: Yamasaki , CMotoo 

/ APPLICANT: Shibata , CKenj i 

; APPLICANT: Sato , CYasuf umi 

; TITLE OF INVENTION: PEPTIDES WHICH PROMOTE ACTIVATION OF LATENT TGF- AND 
METHOD 

; TITLE OF INVENTION: OF SCREENING TGF- ACTIVITY -REGULATING COMPOUNDS 
; FILE REFERENCE: 11060 

; CURRENT APPLICATION NUMBER: US/09/214 , 592A 
; CURRENT FILING DATE: 1999-01-18 
; NUMBER OF SEQ ID NOS : 34 
; SOFTWARE : 
; SEQ ID NO 32 

LENGTH: 373 

TYPE : PRT 

ORGANISM: chicken 
US-09-214-592-32 



Query Match 50.3%; Score 1063.5; DB 11; Length 373; 

Best Local Similarity 54.9%; Pred. No. 6.2e-90; 

Matches 209; Conservative 51; Mismatches 100; Indels 21; Gaps 8; 
Qy 3 0 LSTCKTIDMELVPCRKRIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVA 8 9 

Mlh MM h 1 1 II h M M I II I II Ml : II 1 1 h I 111111- = 

Db 2 LSTCQRLDLEAAKKKRIEAVRGQILSKLRLTAPPPASETPPRPLPDDVRALYNSTOELLK 61 

Qy 90 GES - VEPEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPV 148 

: : | |: :|;|||: |: | : : : :: ||:: :|| | | | 
Db 62 QRARLRPPPDGPDEYWAKELRRI PMETTWDGAMEHWQPQSHSI FFVFNVSRARRG-GRPT 120 



Qy 14 9 LLSRAELRLLRLKLK VEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGV 203 

M llllhll I M Mill I I Mill I : : MMMM 

Db 121 LLHRAELRMLRQKAAADSAGTEQRLELYQGYGNASWRYLHGRSVRATADDEWLSFDVTDA 180 



Qy 204 VRQWLTRREAIEGFRLSAHSSSD SKDNTLHVEINGFNSGRRGDLATIHGMNR- -PFL 258 

I llh I : h|| I : : = I || :||h =1 : I h = 

Db 181 VHQWLSGSELLGVFKLSVHCPCEMGPGHAEEMR I S I EGFEQ - QRGDMQS I AKKHRRVP YV 239 

Qy 259 LLMATPLERAQHLHSSRHRRALDTNYCFS-STDYKDDDDKEKNCCVRQLYIDFRKDLGWK 317 

I II I 1 1 1 I II llhlll II IMIMI MINIMI II 

Db 240 LAMALPAERANELHSARRRRDLDTDYCFGPGTD EKNCCVRPLYI DFRKDLQWK 292 

Qy 318 WIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYY 377 

Ml Mill MIMMMMM II II M M M II M M I II M M M II MM II MM I 

Db 293 WIHEPKGYMANFCMGPCPYIWSADTQYTKVLALYNQHNPGASAAPCCVPQTLDPLPIIYY 352 

Qy 378 VGRKPKVEQLSNM I VRS CKCS 3 98 

III MIMIM'MMMM 

Db 353 VGRNVRVEQLSNMWRACKCS 373 



Search completed: October 28, 2003, 09:28:00 
Job time : 25.1473 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



October 28, 2003, 00:37:16 ; Search time 13.1517 Seconds 

(without alignments) 
2910.285 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-017-372E-13 
2114 

1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNMI VRSCKCS 398 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



283308 seqs, 96168682 residues 



Total number of hits satisfying chosen parameters: 



283308 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : PIR_76:* 
1: pirl:* 
2: pir2:* 
3: pir3:* 
4: pir4:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 



A27512 

transforming growth factor beta-1 precursor - pig 
N;Alternate names: TGF-beta 

C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 05-Jun-1988 #sequence_revision 05-Jun-1988 #text_change 16-Jul-1999 

C;Accession: A27512; A26356; 146657 

R;Derynck, R.; Rhee, L. 

Nucleic Acids Res. 15, 3187, 1987 

A;Title: Sequence of the porcine transforming growth factor-beta precursor. 
A;Reference number: A27512; MUID : 87174844 ; PMID: 3470708 
A;Accession: A27512 
A /Molecule type: mRNA 
A;Residues: 1-390 <DER> 

R;Cheifetz, S.; VJeatherbee, J. A.; Tsang, M.L.S.; Anderson, J.K.; Mole, J.E.; 
Lucas, R. ; Massague, J. 
Cell 48, 409-415, 1987 

A; Title: The transforming growth factor-beta system, a complex pattern of cross - 
reactive ligands and receptors. 

A;Reference number: A90890; MUID : 87102890 ; PMID:2879635 
A; Accession : A26356 
A;Molecule type: protein 
A;Residues: 279-322 <CHE> 

R;Kondaiah, P.; Van Obberghen-Schilling, E.; Ludwig, R . L . ; Dhar, R.; Sporn, 
M.B. ; Roberts, A . B . 

J. Biol. Chem. 263, 18313-18317, 1988 

A;Title: cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 
Evidence for alternate splicing and polyadenylation . 
A;Reference number: 146657; MUID : 89054010 ; PMID:2461367 
A /Accession : 14 6657 

A/Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-390 <K0N> 

A; Cross-references : GB:M23703; NID:g755044 ; PIDN : AAA64616 . 1 ; PID:g755045 
C;Genetics : 

A; Gene: TGFB ; TGF -beta-1 
C; Superfamily : inhibin 
C; Keywords: growth factor 

Query Match 96.4%; Score 2038; DB 2; Length 390; 

Best Local Similarity 97.2%; Pred. No. 8e-155; 

Matches 387; Conservative 0; Mismatches 3; Indels 8; Gaps 1; 

1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I i 1 1 1 , 1 1 .! i 1 i , , i 1 ; 1 1 1 1 1 1 1 1 1 , 1 1 , i 1 1 1 1 1 1 1 1 1 1 ; 1 1 1 1 i 1 1 1 , 

1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 
61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

MIIMIIMIIIIIIMMIMMlllllllllllilillllllMMMMMIMII 

61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 
121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

MIMIIMIIMIIIIIIIIMIIMIIIMIIIIIIIIIIIIIIIIIIIMIIMIII 

121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 
181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

IIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII I lllllllllllllll 

181 YLSNRLLAPSDS PEWLSFDVTGWRQWLTRR EAI EGFRLSAHCSCDSKDNTLHVE INGFN 24 0 



Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 300 

i ;i i in 1 1 1 in i ii 1 1 1 1 ii 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1: ii i ii ii M : in 

Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKN 292 

Qy 301 CCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

M 1 1 1 1 1! i M II II M M M 1 1 1 M 1 1 M II I M M i M ( M I M 1 1 1 M I M 1 1 M 1 1 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

1 1 II Ml 1 1 1 Ii, 1 1 1 1 1 1 , 1 1 1 II I '1 1 1! 1 1 1 1 1 1 1 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 

RESULT 2 
S01413 

transforming growth factor beta-1 precursor - chicken 
C; Species: Gallus gallus (chicken) 

C;Date: 30-Jun-1989 #sequence_revision 30-Jun-1989 #text_change 24-Nov-1999 
C; Access ion : SO 14 13 

R;Jakowlew ; S.B.; Dillard, P.J.; Sporn, M.B.; Roberts, A . B . 
Nucleic Acids Res. 16, 8730, 1988 

A; Title: Nucleotide sequence of chicken transforming growth factor-beta 1 (TGF- 
beta 1) . 

A;Reference number: S01413; MUID : 88335639 ; PMID:3166520 
A; Accession : SO 14 13 
A;Molecule type: DNA 
A/Residues: 1-391 <JAK> 

A; Cross -references : EMBL:X12373; NID:g63808; PIDN : CAA30933 . 1 ; PID:g63809 
C; Superf amily : mhibin 
C; Keywords: growth factor 

Query Match 94.9%; Score 2006.5; DB 2; Length 391; 

Best Local Similarity 96.2%; Pred. No. 2.6e-152; 

Matches 384; Conservative 0; Mismatches 6; Indels 9; Gaps 2; 

.PSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

Ml I Ml ii 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1, 1 h MM II 1 1 1 1 M I II 

'PSGPGLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 
•PSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

M MM M I Ml Ml I II 1 1 III I II Ml II Ml I II I MM M II IMIM I MM I M 

•PSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 
IKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

M M M MIMI M II II I II III II MM I Ml I II MM IMI IMI! 1 1 MM I 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


240 


Db 


241 



I III 1 II MM MINI III III inn milium I M 



Qy 



300 



NCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGAS 359 



I M II II 1 1 1 1 II M II 1 1 1 1 1 1 1 II II II 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 II I II 1 1 II II I 

Db 2 93 NCCVRQLYI DFRKJ3LGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHNPGAS 352 

Qy 3 60 AAPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 3 98 

I IN M 1 1 1 1 1 1 1 1 1 1 1 1. 1 1 1 1 1 II I ! 1 1 1 11 1 II : 1 1 

Db 353 AAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 391 



RESULT 3 
146463 

transforming growth factor beta-1 - sheep 

C;Species: Ovis orientalis aries, Ovis ammon aries (domestic sheep) 

C;Date: 19-Dec-1997 #sequence_revision 19-Dec-1997 #text__change 24-Nov-1999 

C; Accession: 146463; S45115 

R;Woodall, C.J.; McLaren, L.J.; Watt, N.J. 

Gene 150, 371-373, 1994 

A;Title: Sequence and chromosomal localisation of the gene encoding ovine latent 
transforming growth factor-beta 1. 

A/Reference number: 146463; MUID : 95121932 ; PMID:7821809 
A; Accession: 14 64 63 

A;Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 1-390 <WOO> 

A; Cross-references : EMBL:X76916; NID:g496648; PIDN : CAA54242 . 1 ; PID:g496649 
A; Note: submitted to the EMBL Data Library, December 1993 
C; Super family : inhibin 

Query Match 91.5%; Score 1934; DB 2; Length 390; 

Best Local Similarity 91.7%; Pred. No. 1.6e-146; 

Matches 365; Conservative 10; Mismatches 15; indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

i ; i i 

Db 1 MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTI DMELVKRKGI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

1 1! 1 1 1 1 1 1 1 1 1 1 1 1 h.'ll 1 1 1 II 1 1 '1 1 1 1 I MM MMMMIIMM Ihl 

Db 61 SPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Ml I : I h 1 1 MIMM 1 1 III Ml 1 1 ; MMM MM IMMM I II I MMMM 

Db 121 YDKMKSSSHSI YMFFNTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 M 1 1 1 1 1! 1 1 1 1 MMMM I MMM 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 300 

i i Mi i i M i i i i i i i i i i i M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 III 

Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 36 0 

1 1 MMMM M MM! MMIMu Mi i M M i M IM! MM IMMMM M Mi MMM 

Db 293 CCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 3 98 

iiiiiiiiiiiiiiiiiiiiiiiiiiiniiiiiiiii 



Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 4 
WFHU2 

transforming growth factor beta-1 precursor [validated] - human 
N; Alternate names: growth- inhibitory factor; TGF type 2; TGF-beta 
C; Species: Homo sapiens (man) 

C;Date: 28-Feb-1986 #sequence_revision 19-Oct-1995 #text_change 08-Dec-2000 
C;Accession: A27513; A01395; A22290; 159664; S53444 
R;Derynck, R. ; Rhee, L. ; Chen, E.Y.; Van Tilburg, A. 
Nucleic Acids Res. 15, 3188-3189, 1987 

A; Title: Intron-exon structure of the human transforming growth factor-beta 
precursor gene. 

A/Reference number: A27513; MUID: 87174845 ; PMID: 3470709 
A;Accession : A27513 
A; Molecule type: DNA 
A; Residues : 1-390 <DER> 

A; Cross-references: GB:X05839; GB:Y00112; NID:g37097; PIDN : CAA29283 . 1 ; 
PID:gl212989 

R;Derynck, R. ; Jarrett , J. A.; Chen, E.Y.; Eaton, D.H.; Bell, J.R.; Assoian, 
R.K.; Roberts, A.B.; Sporn, M.B.; Goeddel , D.V. 
Nature 316, 701-705, 1985 

A; Title: Human transforming growth factor-beta complementary DNA sequence and 
expression in normal and transformed cells. 
A;Reference number: A01395; MUID: 85296301 ; PMID : 3 861940 
A; Access ion : AO 13 95 
A; Molecule type: mRNA 

A; Residues: 1-9, 'P' ,11-24, ' P' ,26-159, f R f ,160-3 9 0 <DE2> 

A; Cross-references: GB:X02812; GB:J05114; NID:g37092; PIDN : CAA26580 . 1 ; 
PID:g37093 

A;Note: the authors suggest that residues 8-23 could represent the hydrophobic 

core of an amino-terminal signal peptide 

R;Massague, J.; Like, B. 

J. Biol. Chem. 260, 2636-2645, 1985 

A;Title: Cellular receptors for type beta transforming growth factor. Ligand 

binding and affinity labeling in human and rodent cell lines. 

A;Reference number: A22290; MUID : 85131019 ; PMID-.2982829 

A; Access ion: A222 90 

A; Molecule type: protein 

A;Residues: 279-295, 'XX' , 298-301 <MAS> 

R;Urushizaki , Y . ; Niitsu, Y. ; Terui, T. ; Koshida, Y. ; Mahara, K. ; Kohgo, Y. ; 
Urushizaki, I.; Takahashi, Y . ; I to, H. 
Tumor Res. 22, 41-55, 1987 

A; Title: Cloning and expression of the gene for human transforming growth 
factor-beta in Escherichia coli. 
A; Reference number: 159664 
A; Access ion : 159664 

A; Status : preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A;Residues: 279-390 <RES> 

A; Cross-references: GB:M38449; NID:g339557; PIDN : AAA36735 . 1 ; PID:g339558 
R;Stam, K. ; Stewart, A. A. ; Qu, G.Y.; Iwata, K.K.; Fenyoe, D. ; Chait, B.T.; 
Marshak, D.R.; Haley, J.D. 
Biochem. J. 305, 87-92, 1995 

A;Title: Physical and biological characterization of a growth- inhibitory 
activity purified from the neuroepithelioma cell line A673 . 



A; Reference number: S53444; MUID : 95126934 ; PMID:7826358 
A; Access ion: S53444 
A; Status : preliminary 
A; Molecule type: protein 
A;Residues: 279-297 <STA> 

C; Comment: The mature protein is the carboxyl -terminal segment of a precursor 
polypeptide; the active molecule is a dimer of identical polypeptide chains 
linked by an interchain disulfide bond. 
C; Genetics : 

A; Gene: GDB : TGFB1 ; TGFB 

A; Cross -references : GDB: 120729; OMIM: 190180 
A ; Map position: 19ql3 . 2 -19ql3 . 2 
C; Superf amily : mhibin 

C; Keywords: glycoprotein; growth factor; homodimer; mitogen; transformation 
F ; 1-18/ Doma in : signal sequence #status predicted <SIG> 
F; 19 -278/Domain : propeptide #status predicted <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F; 27 9 -3 90/Product : transforming growth factor beta-1 #status experimental <MAT> 
F; 82 , 136, 176/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 9 0.9%; Score 1921; DB 1; Length 390; 

Best Local Similarity 91.7%; Pred . No. 1.7e-145; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKT1DMELVKRKRI EAIRGQILSKLRLA 60 

I Ml Ml I I I I I I . I I I I I I I I I 1 J J i I I I I I I I I I I ! I ! I I ( I I ! I I I I I ! I I I I 
Db 1 MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

IMIIMIIilllllllllllllllllMMI ' 1 1 1 1 1 1 'M 1 II : II ! 1 1 II |: hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

Mill : Ihll I II I I I M I I I I 'I I I I I I I I I I I I I II , III Ml! I I I I I: I II 
Db 121 YDKFKQSTHS I YMFFNTSELREAVPEP VT.LSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 = I 1 1 1 1 1 1 1 1 1 I MMIM MINI 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 300 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I Ml 
Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 292 

Qy 301 CCVRQLY I DFRKDLGWKWI HEPKGYHANFCLGPCPY I WSLDTQYSKVLALYNQHNPGASA 360 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2 93 CCVRQLY I DFRKDLGWKWI HEPKGYHANFCLGPCPY I WSLDTQYSKVLALYNQHNPGASA 3 52 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

I I f I I I I 1 I I I t I I I I I I I I I I I I I I I I I I I f I I I I I I 
Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT 5 
A26960 

transforming growth factor beta-1 precursor - green monkey 
C; Species: Cercopithecus aethiops (green monkey, grivet) 



C;Date: 05-Oct-1988 #sequence_revision 05-Oct-1988 #text_change 24-Nov-1999 
C; Access ion: A26960 

R; Sharpies , K. ; Plowman, G.D.; Rose, T.M.; Twardzik, D.R.; Purchio, A.F. 
DNA 6, 239-244, 1987 

A;Title: Cloning and sequence analysis of simian transforming growth factor-beta 
cDNA. 

A;Reference number: A26960; MUID : 87246074 ; PMID:3474130 
A; Accession: A2 6 9 6 0 
A; Molecule type: mRNA 
A;Residues: 1-390 <SHA> 

A; Cross -references : GB:M16658; NID:gl76552; PIDN : AAA35369 . 1 ; PID:gl76553 
C ; Superf amily : inhibin 
C; Keywords : growth factor 

F ; 1 - 16 /Domain : signal sequence #status predicted <SIG> 

F; 17-390/ Product : transforming growth factor beta #status predicted <MAT> 

Query Match 90.8%; Score 1919; DB 2; Length 390; 

Best Local Similarity 91.5%; Pred. No. 2.5e-145; 

Matches 364; Conservative 10; Mismatches 16; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

I 1 1 II I! II ! I M 1 1 1 M I M I I M II 1 1 II II 1 1 1 1 II II II M llllllllll 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKTIDMELVKRKRIETIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

I I I I I : 1 1 I ' M 'M i Mllllll IIIMM M 1 1 1. 1 1 1 .. I ! I I I I ! ,: 1 1 1 ' h hi 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 

Mill : IHI 1 1 1 1 1 1 1 1 1 1 1 1 1 1 IN 1 1! 1 1 1 1 M ','1 1 1 1 1 1 ! 1 1 M !l hi M 

Db 121 YDKFKQSTHSI YMFFNTSELREAVPEPVTiLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

MlllllllhlllMIIIIIMIIIIhl IMIIIIII I IIIMM 

Db 181 YLSNRLLAPSNS PEWLS FDVTG WRQWLSRGGE I EGFRLS AHCSCDS KDNTLQVDI NGFT 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 300 

MIIMIIIIIIIIMIIMMIIIMMII 1 1 II 1 1 II I II II 1 1 1 1 III 

Db 241 TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 r 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPI VYYVGRKPKVEQLSNMT VRSCKCS 398 

1 1 1 . i ! 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 i i ; 1 1 1 . . 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 6 
JC4023 

transforming growth factor beta-1 - dog 
C; Species: Canis lupus familiaris (dog) 

C;Date: 13-Jun-1995 #sequence_revis ion 14-Jul-1995 #text_change 24-Nov-1999 
C; Accession: JC4 023 

R; Manning, A.M.; Auchampach, J. A. ; Drong, R.F.; Slightom, J.L. 
Gene 155, 3 07-308, 1995 



A; Title: Cloning of a canine cDNA homologous to the human transforming growth 
factor-beta 1 -encoding gene. 

A;Reference number: JC4023; MUID : 9523763 0 ; PMID:7721110 
A; Access ion : JC4 023 
A;Molecule type: mRNA 
A;Residues: 1-390 <MAN> 

A; Cross-references: GB:L34956; NID:g516071; PIDN : AAA51458 . 1 ; PID:g516072 

C; Comment: This factor plays a multifunctional role as a regulator of mammalian 

cell growth and as a modulator of immune responses. 

C;Genetics : 

A; Gene: tgf-betal 

C; Superf amily : inhibin 

C;Keywords: growth factor; transforming protein 

F;288-390/Product : transforming growth factor beta 1 #status predicted <MAT> 

Query Match 89.8%; Score 1898; DB 2; Length 390; 

Best Local Similarity 90.7%; Pred. No. 1.2e-143; 

Matches 361; Conservative 12; Mismatches 17; Indels 8; Gaps 1; 

MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I llllllllllllll I ll'llllll II llh I I'l IIIMIIIIIII III Mlllh 

MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQI LSKLRLS 6 0 



Qy 


i 


Db 


i 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


293 


Qy 


361 


Db 


353 



lllllhlll IIIIIIIMIIIIIIIIIIII I li 1 1 II II I II 1 1 II I 1 II i h hi 



120 



YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I H 'h 1 1 1 '1 1 1 1 1 1 II I IM 1 1 1 1 1 1 II I I'l II I il 1, 1 1 1 1 1 1 1 1 ! Ml 

YEKVKKSPHSIYMLFNTSELREAVPEPVLLSRAELRLLRLKLKAEQHVELYQKYSNDSWR 180 

YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

I'l II | I hlMh 

YLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 

SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

I I I I I I I I I I I I M h I II I I ; I II I I I I I I I I IIIIMIIIIIII III 
SSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQRRALDTNYCFSST EKN 2 92 

CCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 60 

I II i; I' ' I , l 1 I I 1 I I ! , 

CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 
APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

I I ;i 1 1 1 1 1 1 1 1 1 1 m 1 1 1 M 1 1: i ii 1 1 1 ii 1 1 m i 

APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 



RESULT / 
WFMS2 

transforming growth factor beta-1 precursor - mouse 
N; Alternate names: TGF type 2; TGF-beta 
C; Species: Mus musculus (house mouse) 

C;Date: 04-Dec-1986 #sequence_revision 04-Dec-1986 #text_change 24-Nov-1999 
C;Accession: A01396 

R,-Derynck, R. ; Jarrett, J.A. ; Chen, E.Y.; Goeddel , D.V. 
J. Biol. Chem. 261, 4377-4379, 1986 



A;Ticle*. The murine transforming growth factor-beta precursor. 
A/Reference number: A01396; MUID : 86168 12 9 ; PMID:3007454 
A /Access ion : AO 13 9 6 
A; Molecule type: mRNA 
A;ResidueS: 1-390 <DER> 

A; Cross-references : GB:M13177; NID:g201952 ; PIDN: AAA40423 . 1 ; PID:g201953 
A;Note: the authors suggest that residues 8-23 could represent the hydrophobic 
core of an amino-terminal signal peptide 

C; Comment: The mature protein is the carboxyl -terminal segment of a precursor 
polypeptide; the active molecule is a dimer of identical polypeptide chains 
linked by an interchain disulfide bond. 
C; Superfamily : inhibin 

C;Keywords: glycoprotein; growth factor; growth regulation; homodimer; mitogen; 
transformation 

F; 1 -23 /Domain : signal sequence #status predicted <SIG> 
F;24-278/Domain: propeptide #status predicted <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F;279-390/Product : transforming growth factor beta-1 #status predicted <MAT> 
F; 82 , 136 , 17 6 /Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 87.0%; Score 1840; DB 1; Length 390; 

Best Local Similarity 86.9%; Pred. No. 5e-139; 

Matches 346; Conservative 15; Mismatches 29; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I 1 1 ! i 1 1 1 1 1 il 1 1 II M Ml I M I M M II III I IIMM I M 1 1 M M IMM I II 

Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

II I MM M I M M 1 1 MMI I II 1 1 1 1 M M M M M MM 1 1 1 M 1 1 1 1 II 1 1 M II 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I M! lllhMIIIIII Mill I Ml llllllllllllhlll 

Db 121 YEKTKDISHSI YMFFNTSDIREAVPEPPLLSRAELRLQRLKSSVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

II 1 1 1 1 M i M M ■ I M I MM M MM : = hill III I Mill M MM 

Db 181 YLGNRLLTPTDTPEWLSFDVTG'VWQWLNQGDGIQGFRFSAHCSCDSKDNKLHVEINGIS 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

Mill III II II Ml MM 1 1 1 1 M 1 1 Ml 1 1 MMM II 1 1 M II I III 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKN 292 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 60 

1 1 MM 1 1 1 1 MM II I M 1 1 II Ml II MM 1 1 M ! II M I M M II II I M M IM IM II 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy Jbl APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 353 SPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 8 
S10219 

transforming growth factor beta-1 precursor - rat 
N; Alternate names: TGF type 2; TGF-beta 



C; Species: Rattus norvegicus (Norway rat) 

C;Date: 12-Feb-1993 #sequence__revision 12-Feb-1993 #text_change 24-Nov-1999 
C;Accession: S10219; PT0023; S02267 

R;Qian, S.W.; Kondaiah, P.; Roberts, A.B.; Sporn, M.B. 
Nucleic Acids Res. 18, 3059, 1990 

A/Title: cDNA cloning by PCR of rat transforming growth factor beta-1. 
A;Reference number: S10219; MUID : 90272425 ; PMID:2349108 
A /Access ion : SI 02 19 
A; Molecule type: mRNA 
A;Residues: 1-390 <QIA> 

A; Cross-references: EMBL:X52498; NID:g57341; PIDN:CAA36741.1; PID:g57342 
R;Okada, F . ; Yamaguchi, K. ; Ichihara, A.; Nakamura, T. 
J. Biochem. 106, 304-310, 1989 

A/Title: Purification and structural analysis of a latent form of transforming 
growth factor-beta from rat platelets. 

A/Reference number: PT0023; MUID : 90036779 ; PMID:2478527 
A; Accession: PT0023 
A;Molecule type: protein 

A; Residues: 30-32, 'X 1 , 34-38, 'Q' ,40-42, 'X',44 <OKA> 
R;Okada, F. ; Yamaguchi, K. ; Ichihara, A.; Nakamura, T. 
FEBS Lett. 242, 240-244, 1989 

A;Title: One of two subunits of masking protein in latent TGF-beta is a part of 
pro-TGF-beta . 

A/Reference number: S02267; MUID: 89121078 ; PMID:2914605 

A; Access ion: S02267 

A; Molecule type: protein 

A;ResidueS: 30-32, 'X' ,34-38, 'Q', 40-42, 'X' ,44 <OK2> 
C; Superf amily : inhibin 

C; Keywords: glycoprotein; growth factor; integrin binding 
F; 1-29/Domain : signal sequence #status predicted <SIG> 
F;30-278/Domain: propeptide #status experimental <PRO> 
F;244-246/Region: cell attachment (R-G-D) motif 

F ; 27 9 - 3 9 0/Product : transforming growth factor beta-1 #status predicted <MAT> 
F;82,136, 176/Bmding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 87.0%; Score 1840; DB 2; Length 390; 

Best Local Similarity 86.9%; Pred. No. 5e-139; 

Matches 346; Conservative 14; Mismatches 30; Indels 8; Gaps 1; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAIRGQILSKLRLA 60 

I lllllllllllll I I I I I I I I [ ' i I I I ! I I I I I I I I I ! I I I I , I I Ill 

Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhlllllllllllllllllllllllll MIIIIIMIIIIIIIIIIIh I I 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVTiMVDRNNAI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

ill i ihii Mih: I Mill l lllllllll I I 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 

Db 121 YDKTKDITHS I YMFFNTSDI REAVPEPPLLSRAELRLQRFKSTVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

I I IMI hhlllllllllllllll : : hill III I Mill IMIMI 
Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGIQGFRFSAHCSCDSKDNVLHVEINGIS 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

Mill III 1 1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 III 



Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKN 2 92 



Qy 



301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 



Db 



2 93 CC VRQL Y I DFR KDLGW KW I H E PKG Y HANF CLG PC P Y I WS LDTQ YS KVLAL YNQHN PGASA 352 



Qy 



3 61 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 



Db 




RESULT 9 
A40057 

transforming growth factor beta-1 precursor - bovine (fragment) 

N;Alternate names: beta-TGF; cartilage-inducing factor-A; EGF-dependent TGF or 

dEGF; MGF-b; milk growth factor b; TGF-type II 

C; Species: Bos primigenius taurus (cattle) 

C;Date: 28-Feb-1992 #sequence_revision 28-Feb-1992 #text_change 16-Jul-1999 
C;Accession: A40057; A42320; A05284; A24322; B61439 

R;Van Obberghen-Schill ing, E. ; Kondaiah, P.; Ludwig, R.L.; Sporn, M . B . ; Baker, 
CC. 

Mol. Endocrinol. 1, 693-698, 1987 

A; Title: Complementary deoxyribonucleic acid cloning of bovine transforming 
growth f actor-betal . 

A; Reference number: A40057; MUID: 91042552 ; PMID:3153459 
A/Accession : A40057 
A; Molecule type: mRNA 
A;Residues: 1-315 <VAN> 

A; Cross-references : GB:M36271; NID:gl63747; PIDN : AAA30778 . 1 ; PID:gl63748 
R;Ogawa, Y.; Schmidt, D.K.; Dasch, J.R.; Chang, R.J.; Glaser, C.B. 
J. Biol. Chem. 267, 2325-2328, 1992 

A;Title: Purification and characterization of transforming growth f actor-beta2 . 3 

and -betal.2 heterodimers from bovine bone. 

A/Reference number: A42320; MUID: 92129307 ; PMID: 1733936 

A/Accession : A42320 

A;Molecule type: protein 

A/Residues: 204-209, 'X» , 211-217 <OGA> 

R;Roberts, A.B.; Anzano, M.A. ; Meyers, C.A. / Wideman, J. ; Blacher, R. ; Pan, 
Y.C.E.; Stein, S . ; Lehrman, S.R.; Smith, J.M.; Lamb, L.C.; Sporn, M.B. 
Biochemistry 22, 5692-5698, 1983 

A; Title: Purification and properties of a type beta transforming growth factor 
from bovine kidney. 

A;Reference number: A05284; MUID: 84104793 ; PMID:6607069 

A /Access ion : AO 52 8 4 

A; Molecule type: protein 

A;Residues: 204-218 <ROB> 

R;Seyedin, S.M.; Thompson, A.Y.; Bentz, H.; Rosen, D.M.; McPherson, J.M.; Conti, 
A. ; Siegel, N.R. ; Galluppi, G.R.; Piez, K. A . 
J. Biol. Chem. 261, 5693-5695, 1986 

A ; Title: Cartilage- inducing factor-A. Apparent identity to transforming growth 
factor-beta . 

A/Reference number: A24322; MUID : 86195954 ; PMID: 3754555 
A; Accession : A24 322 
A/Molecule type: protein 
A;ResidueS: 204-233 <SEY> 

R;Jin, Y. ; Cox , D.A.; Knecht , R. ; Raschdorf, F . ; Cerletti, N. 
J. Protein Chem. 10, 565-575, 1991 



A;Title-. Separation, purification, and sequence identification of TGF-betal and 
TGF-beta2 from bovine milk. 

A;Reference number: A61439; MUID : 92 189724 ; PMID: 1799413 
A; Accession: B61439 
A;Molecule type: protein 

A;Residues: 204-209, 'X' ,211-217, 'XX' ,220-232 <JIN> 

C; Comment: This polypeptide is composed of two polypeptide chains cross -linked 
by disulfide bonds. It has been found in neoplastic and non-neoplast ic tissues. 
C;Comment: Type II TGF does not bind to the EGF receptor and lacks intrinsic 
mitogenic activity, but in soft agar, it reacts synergistically with either type 
I TFG or EGF, and induces cell proliferation. Cells grown in monolayer do not 
respond in a similar manner to these growth factors, but morphologically do 
acquire a transformed phenotype. 
C ; Superf ami ly : inhibin 

C; Keywords: glycoprotein; growth factor; heterodimer 

F;204-315/Product : transforming growth factor beta-1 #status experimental <MAT> 
F ; 7 , 61 , 1 01/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 74.6%; Score 1577; DB 2; Length 315; 

Best Local Similarity 91.3%; Pred. No. 3.5e-118; 

Matches 295; Conservative 9; Mismatches 11; Indels 8; Gaps 1; 

Qy 76 AVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLF 13 5 

I : I M I I I ' I I I I I ' I | I ! M I II ' II I II I I I I ' 'I I ' - ' I I I I : ||:|| | 
Db 1 AILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKIYDKMKSSSHSIYMFF 60 

Qy 136 NTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEW 195 

lllllllllllllllll|::|l!lllllllllllllllll|:|IIIIIIIIIIIIIIIM 
Db 61 NTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEW 120 

Qy 196 LSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNR 255 

1 1 1 1 1 1 ii 1 1 : 1 1 1 1 1 1 iiiiiiiii i I'M 1 h mi h iiim mi i in 

Db 121 LSFDVTGWRQWLTRREEIEGFRLSAHCSCDSKDNTLQVDINGFSSGRRGDLATIHGMNR 180 

Qy 2 56 PFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLG 315 

I III Ml 1 1 M Ml MM II I M M I IM II 1 1 1 , MMMMMMMIMI 

Db 181 PFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKNCCVRQLYI DFRKDLG 232 

Qy 316 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIV 375 

1 1 1 Ml 1 1 II M II M III 1 1 1 II II M I II M II M 1 1 Ml 1 1 1 1 MM I II I M MM 

Db 233 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIV 2 92 

Qy 3 76 YYVGRKPKVEQLSNMIVRSCKCS 3 98 

MMMMMMMMMMMI 

Db 2 93 YYVGRKPKVEQLSNMIVRSCKCS 315 



RESULT 10 
A4I916 

transforming growth factor beta-4 precursor - chicken (fragment) 
N;Alternate names: TGF-beta 4 
C;Species: Gallus gallus (chicken) 

C;Date: 31-Dec-1993 #sequence_revision 31-Dec-1993 #text_change 16-Jul-1999 

C;Accession: A41918; A34941; S03110 

R;Burt, D.W.; Jakowlew, S.B. 

Mol . Endocrinol. 6, 989-992, 1992 



A;Title: Correction: a new interpretation of a chicken transforming growth 
factor-beta 4 complementary DNA . 

A; Reference number: A41918; MUID : 92357039 ; PMID: 1353860 
A; Accession : A41918 
A; Molecule type: mRNA 
A;Residues: 1-373 <BUR> 

A; Cross-references : GB:M31160; GB:X08012; GB:S41706; NID : gl262437 ; 
PIDN:AAB05637 . 1 ; PID:gl262438 

A;Note: sequence extracted from NCBI backbone (NCBIN: 110186, NCBIP : 110187 ) 
A;Note: this report corrects and reinterprets the sequence from reference A34941 
R;Jakowlew, S.B.; Dillard, P.J.; Sporn, M.B.; Roberts, A . B . 
Mol. Endocrinol. 2, 1186-1195, 1988 

A;Title: Complementary deoxyribonucleic acid cloning of a messenger ribonucleic 
acid encoding transforming growth factor beta-4 from chicken embryo 
chondrocytes . 

A;Reference number: A34941; MUID: 89112198 ; PMID:2464131 
A /Accession: A3 4 941 
A /Molecule type: mRNA 

A; Residues : ' MDPMSIGPKSCGGSPWRPPGTAPWSIGSRRATASSSCSTSSRVRAEVGGRAL ' , 122- 

209, 'D' , 211-373 <JAK> 

A; Cross-references : EMBL:X08012 

A;Note: this sequence has been corrected in A41918 
C; Superf amily : inhibin 

C; Keywords: glycoprotein; growth factor 

F;l/Domain: signal sequence (fragment) #status predicted <SIG> 
F;223-225/Region: cell attachment (R-G-D) motif 

F;260-373/Product : transforming growth factor beta-4 #status predicted <MAT> 
F; 54 , 109 , 153/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 50.3%; Score 1063.5; DB 2; Length 373; 

Best Local Similarity 54.9%; Pred. No. 4.1e-77; 

Matches 209; Conservative 51; Mismatches 100; Indels 21; Gaps 8; 

Qy 3 0 LSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVA 8 9 

1111= MM hllllhllllllllll HI II llh I 111111- = 

Db 2 LSTCQRLDLEAAKKKRI EAVRGQI LSKLRLTAPPPASETPPRPLPDDVRALYNSTQELLK. 61 

Qy 9 0 GES - VEPEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPV 14 8 

: : | |: :|:|||: |: | : = : =: ||:: :|| | | | 
Db 62 QRARLRPPPDGPDEYWAKELRRIPMETTWDGAMEHWQPQSHSIFFVFNVSRARRG-GRPT 120 

Qy 14 9 LLSRAELRLLRLKLK VEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGV 2 03 

II IIMIMI I II Ml, I I Mill I : : MINIM 

Db 121 LLHRAELRMLRQKAAADSAGTEQRLELYQGYGNASWRYLHGRSVRATADDEWLSFDVTDA 18 0 

Qy 2 04 VRQWLTRREAI EGFRLSAHSSSD S KDNTLH VE I NGFNSGRRGDLATI HGMNR - - PFL 2 58 

I MM I : MM I : -III MM' : I M h : 

Db 181 VHQWLSGSELLGVFKLSVHCPCEMGPGHAEEMRI S I EGFEQ - QRGDMQS I AKKHRR VPYV 239 

Qy 259 LLMATPLERAQHLHSSRHRRALDTNYCFS-STDYKDDDDKEKNCCVRQLYIDFRKDLGWK 317 

I II I III II IM II 1 1 Ml II M III MM II II II Ml II 

Db 240 LAMALPAERANELHSARRRRDLDTDYCFGPGTD EKNCCVRPLYIDFRKDLQWK 292 

Qy 318 WIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYY 377 

MMMM 1 1 1 1 = 1 1 1 1 ! 1 1 ( II 1 1 M M II M II 1 1 1 1 1 M II M II MMIMM 

Db 2 93 WIHEPKGYMANFCMGPCPYIWSADTQYTKVLALYNQHNPGASAAPCCVPQTLDPLPIIYY 352 



Qy 3 78 VGRKPKVEQLSNMI VRSCKCS 3 98 

III : I I ! I I I h I h ' I I I 
Db 3 53 VGRNVRVEQLSNMWRACKCS 373 



RESULT 11 
A34939 

transforming growth factor beta -3 precursor - chicken 
C; Species: Gallus gallus (chicken) 

C;Date: 13-Jul-1990 #sequence_revision 13-Jul-1990 #text_change 16-Jul-1999 
C;Accession: A34939; S25850; S36125; S36124; 151181 

R/Jakowlew, S.B.; Dillard, P.J.; Kondaiah, P.; Sporn, M.B.; Roberts, A . B . 
Mol. Endocrinol. 2, 747-755, 1988 

A;Title: Complementary deoxyribonucleic acid cloning of a novel transforming 
growth factor-beta messenger ribonucleic acid from chick embryo chondrocytes. 
A;Reference number: A34939; MUID : 89096966 ; PMID:3211158 
A /Access ion: A3 4 93 9 

A;Status: preliminary; not compared with conceptual translation 
A /Molecule type: mRNA 
A /Residues: 1-412 <JAK> 

A/Cross-references: GB:M31154; NID:g212758; PIDN : AAA49089 . 1 ; PID:g212759 
R/Burt, D.W.; Paton, I.R.; Dey, B.R. 
J. Mol. Endocrinol. 7, 175-183, 1991 

A/Title: Comparative analysis of human and chicken transforming growth factor- 
beta-2 and -beta-3 promoters . 

A/Reference number: S25850; MUID : 92134496 / PMID: 1840616 
A /Access ion : S258 50 
A; Status : prel iminary 
A; Molecule type: DNA 
A/Residues : 1-117 <BUR> 

A/Cross-references: EMBL: X58 127 / NID:g63815; PIDN : CAA4 1128 . 1 ; PID:g63816 
A /Access ion : S3 6 125 

A/Status : preliminary/ nucleic acid sequence not shown; translation not shown 
A/Molecule type: DNA 
A/Residues : 119-172 <BU2> 

A/ Cross -references : EMBL:X60055; NID:g396688; PIDN : CAA42653 . 1 / PID:g396689 
A/Note: the nucleotide sequence was submitted to the EMBL Data Library, June 
1991 

A /Access ion: S3 6 124 

A/Status : preliminary/ nucleic acid sequence not shown/ translation not shown 
A/ Molecule type: DNA 

A/Residues: 173 -322 , 1 ELPT ', 327-4 12 <BU3> 
A/ Cross -references : EMBL: X6 00 91 

A/Note: the nucleotide sequence was submitted to the EMBL Data Library, June 
1991 

R/Jakowlew, S.B.; Lechleider, R. ; Geiser, A.G. / Kim, S.J./ Santa-Coloma , T.A. ; 
Cubert, J.; Sporn, M.B.; Roberts, A.B. 
Mol. Endocrinol. 6, 1285-1298, 1992 

A/Title: identitication and characterization of the chicken transforming growth 
factor-beta 3 promoter. 

A;Reference number: 151181; MUID : 93024487 ; PMID: 1406706 
A; Accession : 151181 
A; Status : prel iminary 
A;Molecule type: DNA 
A/Residues: 1-117 <JA2> 

A/ Cross-references : GB:S46000; NID:g257172/ PIDN : AAB23575 . 1 / PID:g257173 
C; Genetics : 



A;Introns: 216/1; 252/1; 309/2; 360/3 
A;Note: list of introns may be incomplete 
C; Superfamily : inhibin 

C;Keywords: glycoprotein; growth factor; growth regulation; homodimer; mitogen 
F; 1 -22 /Domain : signal sequence #status predicted <SIG> 
F; 23-300/Domain: propeptide #status predicted <PRO> 

F; 3 01 -4 12 /Product : transforming growth factor beta-3 #status predicted <MAT> 
F ; 74 , 142 /Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 41.4%; Score 876; DB 2; Length 412; 

Best Local Similarity 47.1%; Pred. No. 4.3e-62; 

Matches 198; Conservative 56; Mismatches 114; Indels 52; Gaps 14; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

h I :|= : Ihl I : I : I = I = I h I I I I I I I I I I I I I III I I 

Db 9 LVLLSLLSFATVSLALSSCTTLDLEHI KKKRVEAI RGQI LSKLRLTSPPE - - SVGPAHVP 66 

Qy 75 EAVLALYNSTRDRVAGESVEPEPE PEADYYAKEVTRVLMVE SGNQIYDKF 124 

11111111= : I :| I I hMIMh = h= I- 

Db 67 YQILALYNSTRELL- -EEMEEEKEESCSQENTESEYYAKEIHKFDMIQGLPEHNELGICP 124 

Qy 125 KGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND 177 

II : = : II I I I III h I I : = II Mhl I 

Db 125 KGVTSNVFR - FNVS SAEKNSTNLFRAEFRVLRVPNPSSKRSEQRI ELFQ I LRPDEHI 180 

Qy 178 -SWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKD 229 

I I I I I : IIIIIMIII 11 = 11 Ih I :| I 

Db 181 AKQRYLSGRNVQTRGSPEWLSFDVTDTVREWLLHRESNLGLEISIHCPCHTFQPNGDILE 240 

Qy 230 N TLHVEINGFNSG RRGDLATI HGMNRPFLLLMATPLERAQH- -LHSSRHRR 278 

I I =: I :| I I I I = - I hll I h I I : I 

Db 241 NLHE VLE I KFKG I DS EDDYGRGDLGRLKKQKDLHNPHL I LMMLP PHRLES PTLGGQRKKR 300 

Qy 279 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 338 

MINIM = hlllll 1 1 1 1 1 1 - 1 1 1 1 1 1 • 1 1 1 1 1 1 II II llllh 

Db 301 ALDTNYCFRNL EENCCVRPLYIDFRQDLGWKWVHEPKGYFANFCSGPCPYLR 352 

Qy 339 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

I II =1 II III II llhllllll I I I I hlllll I I I I I I I I I : I : I I I I I 
Db 353 SADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 412 



RESULT 12 
B61036 

transforming growth factor beta -5 precursor - African clawed frog 
C; Species: Xenopus laevis (African clawed frog) 

C;Date: 31-Dec-1993 #sequence_revision 03-Feb-1994 #text_change 16-Jul-1999 
C;Accession: A34929; B61036 

R;Kondaiah, P.; Sands, M.J.; Smith, J.M. ; Fields, A.; Roberts, A.B.; Sporn, 
M.B.; Melton, D.A. 

J. Biol. Chem. 265, 1089-1093, 1990 

A;Title: Identification of a novel transforming growth factor-beta (TGF-beta5) 
mRNA in Xenopus laevis. 

A; Reference number: A34929; MUID : 90110090 ; PMID:2295601 
A; Accession : A34 92 9 
A; Molecule type: mRNA 
A;Residues: 1-382 <KON> 



A; Cross -references : GB:J05180; NID:g214821; PIDN : AAA49968 . 1 ; PID:g214822 
R;Roberts, A.B.; Rosa, F. ; Roche, N.S.; Coligan, J.E.; Garfield, M . ; Rebbert , 
M.L.; Kondaiah, P.; Danielpour, D, ; Kehrl, J.H.; Wahl , S.M.; Dawid, I.B.; Sporn, 
M.B. 

Growth Factors 2, 135-147, 1990 

A;Title: Isolation and characterization of TGF-beta2 and TGF-beta5 from medium 
conditioned by Xenopus XTC cells. 

A;Reference number: A61036; MUID : 90253806 ; PMID:2340184 
A; Access ion : B61036 
A;Molecule type: protein 

A; Residues: 271-276, 'X', 278-284, • XX ' , 2 87-299 <R0B> 
C; Super family: inhibin 
C;Keywords: growth factor 

F;271-382/Product : transforming growth factor beta-5 #status experimental <MAT> 

Query Match 41.3%; Score 874; DB 2; Length 382; 

Best Local Similarity 46.9%; Pred. No. 5.6e-62; 

Matches 190; Conservative 55; Mismatches 122; Indels 38; Gaps 11; 

Qy 9 LPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLAS PPSQGDV 68 

= : I M I M I I Mill MM h::|lllllllllll|:| I I 

Db 1 MEV1jWMLLVLLVLHLSSLAMSLSTCI<AVDMEEVRKRRI EAI RGQI LSKLKLDKTPDV- DS 59 

Qy 69 PPGPLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKEVTRVLMVESGNQI YDK 123 

: I Mill : : - I = III I Ml I II - I 

Db 60 EKMTVPSEAI FLYNSTLEVI REKATREEEHVGHDQNI QDYYAKQVYR FES I TELED - 115 

Qy 124 FKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKL- -KVEQHVELYQKYSNDSW- - 179 

I M I Ml I II MM-' : : ::| :||: || : 

Db 116 HEFKFKFNASHVRENVGMNSLLHHAELRMYKKQTDKNMDQRMELFWKYQENGTTH 17 0 

Qy 180 -RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDNTLHVE 235 

III - : I IMMIII I Ml I I I I I : Ml 
Db 171 SRYLESKYITPVTDDEWMSFDVTKTVNEWLKRAEENEQFGLQPACKCPTPQAKD ID 226 

Qy 236 INGFNSGRRGDLATIHGM- -NRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKD 293 

I II Mill:: :|:|:: : | || : ||| :| : ||| : 
Db 227 I EGF - PALRGDLASLSSKENTKPYLMI TSMPAERI DTVTSSRKKRGVGQEYCFGNNG 282 

Qy 2 94 DDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQ 3 53 

MM: M 1 M M M 1 1 1 M I M II I MMM II 1 1 II M I II I II I M II I 

Db 283 PNCCVKPLYINFRKDLGWKWIHEPKGYEANYCLGNCPYIWSMDTQYSKVLSLYNQ 337 

Qy 354 HNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 398 

MMM MMM I M 1 1 1 : 1 1 1 1 1 MIMIMMMI II 

Db 338 NNPGASISPCCVPDVLEPLPIIYYVGRTAKVEQLSNMWRSCNCS 382 



RESULT 1J 
S01825 

transforming growth factor beta-3 precursor - pig 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 30-Sep-1989 #sequence_revis ion 30-Sep-1989 #text_change 16-Jul-1999 
C; Access ion: SO 1825 

R;Derynck, R. ; Lindquist, P.B.; Lee, A.; Wen, D. ; Tamm, J.; Graycar, J.L.; Rhee, 
L.; Mason, A.J.; Miller, D.A. ; Coffey, R.J.; Moses, H.L.; Chen, E.Y. 
EMBO J. 7, 3737-3743, 1988 



A;Title: A new type of transforming growth factor-beta, TGF-beta3. 
A;Reference number: S01824; MUID : 89091120 ; PMID:3208746 
A; Accession : SO 18 2 5 
A /Molecule type: mRNA 
A;Residues: 1-409 <DER> 

A/Cross-references: EMBL:X14150; NID:g2127; PIDN : CAA32363 . 1 ; PID:g2128 
C; Superfamily : inhibin 
C; Keywords : growth factor 

F ; 1 - 2 5 / Doma in : signal sequence #status predicted <SIG> 
F;26-297/Domain: propeptide #status predicted <PRO> 

F; 2 98 -4 0 9 /Product : transforming growth factor beta-3 #status predicted <MAT> 

Query Match 39.7%; Score 838.5; DB 2; Length 409; 

Best Local Similarity 45.6%; Pred. No. 4.1e-59; 

Matches 190; Conservative 58; Mismatches 122; Indels 47; Gaps 14; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLASPPSQGDVPPGPLP 74 

h I M : :||| hi : M II M I II II I I I I I I II III : 

Db 7 LWLALLNFATVSLSMSTCTTLDFDH I KRKR VEAI RGQ I LS KLRLTS P PDPSML - - AN I P 64 

Qy 75 EAVLALYNSTR DRVAGES VE - - PEPEPEADYYAKEVTRVLMV ESGNQI YDKFKG 126 

II I I I I I I ' I II : = hMlllh = h I I = II 
Db 6 5 TQVLDLYNSTRELLEEVHGERGDDCTQENTESEYYAKEI YKFDMI QGLEEHNDLAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

:= II I : = I III I : I h = II = I h I I 

Db 125 ITSKI FR- FNVSSVEK NETNLFRAEFRVLRMPNPSSKRSEQRI ELFQILQPDEHIAK 180 

Qy 179 WRYLSNRLLAPSDS PEWLSFDVTGWRQWLTRREAI EGFRLSAH SSSDSKDN- 230 

I I • : I : MINIM MMI llh I Ml = I M 

Db 181 QRY I DGKNLPTRGAAEWLS FD VTDTVREWLLRRESNLGLE I S I HCPCHTFQPNGD I LEN I 240 

Qy 231 - -TLHVEINGFNS GRRGDLATIHGM- -NRPFLLLMATPLERAQH- -LHSSRHRRALD 281 

= == I M MM = : I IMI I M : I = I MMI 

Db 241 QE VME I KFKG VDS EDD PGRGDLGRLKKKKEHS PHL I LMM I P PDRLDNPGLGAQRKKRALD 300 

Qy 282 TNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKJDLGWKWIHEPKGYHANFCLGPCPYIWSLD 341 

Mill : imi IMMhIIIMIMMIIIMMI llllh I I 

Db 3 01 TNYCFRNL EENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSAD 3 52 

Qy 342 TQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

I M II Ml II II IMI MM MM hi II II II III llh hill II 

Db 3 53 TTHSSVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTAKVEQLSNMWKSCKCS 40 9 



RESULT 14 
A41397 

transforming growth factor beta-3 precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 03-Apr-1992 #sequence_revision 03-Apr-1992 #text_change 16-Jul-19?9 
C;Accession: A41397; A61039; A61225 

R;Miller, D.A.; Lee, A.; Matsui, Y.; Chen, E.Y.; Moses, H.L.; Derynck, R. 
Mol. Endocrinol. 3, 1926-1934, 1989 

A; Title: Complementary DNA cloning of the murine transforming growth factor- 
beta3 (TGFbeta3) precursor and the comparative expression of TGFbeta3 and 
TGFbetal messenger RNA in murine embryos and adult tissues. 
A;Reference number: A41397; MUID : 90190650 ; PMID:2628730 



A ; Access ion : A4 1 3 9 7 
A; Molecule type: mRNA 
A/Residues: 1-410 <MIL> 

A;Cross-references : GB:M32745; NID:g201949; PIDN : AAA4 0422 . 1 ; PID:g201950 
R;Denhez, F. ; Lafyatis, R . ; Kondaiah, P.; Roberts, A.B.; Sporn, M.B. 
Growth Factors 3, 139-146, 1990 

A; Title: Cloning by polymerase chain reaction of a new mouse TGF-beta, mTGF- 
beta3 . 

A/Reference number: A61039; MUID: 91000714 ; PMID:2206556 
A /Access ion: A6103 9 
A; Molecule type: mRNA 
A;Residues: 1-410 <DEN> 

R;Watrin, F. / Scotto, L. ; Assoian, R.K.; Wolgemuth, D.J. 
Cell Growth Differ. 2, 77-83, 1991 

A;Title: Cell lineage specificity of expression of the murine transforming 

growth factor beta-3 and transforming growth factor beta-1 genes. 

A/Reference number: A61225; MUID : 91299576 ; PMID:2069871 

A; Access ion : A6122 5 

A; Status : translation not shown 

A; Molecule type: mRNA 

A;Residues: 285-410 <WAT> 

C; Superf amily : inhibin 

C; Keywords: glycoprotein; growth factor; growth regulation 
F ; 1 -2 1 /Domain : signal sequence #status predicted <SIG> 
F; 22-298/ Doma in : propeptide #status predicted <PRO> 
F;259-261/Region: cell attachment (R-G-D) motif 

F;299-410/Product : transforming growth factor beta-3 #status predicted <MAT> 
F; 72 , 133 , 140/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 3 9.4%; Score 832; DB 2 ; Length 410; 

Best Local Similarity 45.5%; Pred. No. 1.4e-58; 

Matches 190; Conservative 58; Mismatches 122; Indels 48; Gaps 14, 

Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

h I :| : I I I I hi MMIMIIIIIIIIIIII III = =1 

Db 7 LWLALLNLATI SLSLSTCTTLDFGHI KKKRVEAI RGQI LSKLRLTSPPEPSVMT- -HVP 64 

Qy 75 EAVLALYNSTR DRVAGESVE- - PEPEPEADYYAKEVTRVLMVE SGNQI YDKFKG 126 

III II MM = : II I : I • : I I I I M = I- | = : II 

Db 65 YQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFDMIQGLAEHNELAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

= : II I = : I Ml h I I : : II Mhl I 

Db 125 ITSKVFR-FNVSSVEK NGTNLFRAEFRVLRVPNPSSKRTEQRIELFQILRPDEHIAK 180 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAH SSSDSKDN- 230 

Ih : I : M I II M I 11 = 11 llh I Ml = I M 

Db 181 QRYI GG KNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEIS I HCPCHTFQPNGDILENV 24 0 

Qy 231 - -TLHVEINGFNS GRRGDLATIHGM NRPFLLLMATPLER- -AQHLHSSRHRRAL 280 

: I - MM = = I MM I I = II MM 

Db 241 HEVMEIKFKGVDNEDDHGRGDLGRLKKQKDHHNPHLILMMIPPHRLDSPGQGSQRKKRAL 300 

Qy 281 DTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSL 340 

MUM : 1 = 11111 MIIMMMIIIMIIIMMIII 1 1 1 1 1 ■ I 

Db 301 DTNYCFRNL EENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSA 352 



Qy 341 DTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

II =| II III II llhllllll I I I I hlllll llllllllhhMIII 
Db 3 53 DTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 410 



RESULT 15 
A55706 

transforming growth factor beta-3 precursor - rat 
C;Species: Rattus norvegicus (Norway rat) 

C;Date: 03-Mar-1995 #sequence_revision 03-Mar-1995 #text_change 16-Jul-1999 
C;Accession: A55706; B40699; S36042 

R;Wang, J. ; Kul iszewski , M. ; Yee, W. ; Sedlackova, L. ; Xu, J.; Tseu, I.; Post, M. 
J • Biol. Chem. 270, 2722-2728, 1995 

A; Title: Cloning and expression of glucocorticoid- induced genes in fetal rat 

lung fibroblasts. Transforming growth factor-beta-3 . 

A /Reference number: A55706; MUID : 95155340 ; PMID:7852342 

A; Accession: A557 06 

A; Molecule type: mRNA 

A/Residues: 1-410 <WAN> 

A; Cross-references : GB :U03491 

A;Note: it is uncertain whether Met-1 is the initiator 
R;McKinnon, R.D.; Piras, G. ; Ida Jr., J. A.; Dubois -Dalcq, M . 
J. Cell Biol. 121, 1397-1407, 1993 

A;Title: A role for TGF-beta in oligodendrocyte differentiation. 

A;Reference number: A40699; MUID : 93286190 ; PMID:8509457 

A /Accession: B4 0699 

A; Status: preliminary 

A /Molecule type: mRNA 

A/Residues : 157-211 <MCK> 

A/ Cross-references : EMBL : X71903 ; NID:g311326; PIDN : CAA50722 . 1 ; PID:g311327 
C; Superfamily : inhibin 

Query Match 39.2%; Score 828/ DB 2 ; Length 410/ 

Best Local Similarity 45.2%; Pred. No. 2.9e-58; 

Matches 189; Conservative 58; Mismatches 123; Indels 48/ Gaps 14; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRI EAIRGQILSKLRLASPPSQGDVPPGPLP 74 

h I =| = I I I I hi : h I h I I I I Ml I I I I I Ml : M 

Db 7 LWLALLNLATVSLSLSTCTTLDFGH I KKKR VEA I RGQ I LS KLRLTS P PEPS VMT - - HVP 64 

Qy 75 EAVLALYNSTR DRVAGESVE- - PEPEPEADYYAKEVTRVLMVE SGNQI YDKFKG 126 

I I I I J I I I I = : M I : MMIIIh : h= | = : II 

Db 65 YQVTALYNSTRELLEEMHGEREEGCTQETSESEYYAKE I HKFDM I QG LAEHNELAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

-III:: I III hlh : II MIM I 

Db 125 ITSKVFR-FNVSSVEK NGTNLFRAEFRVLRVPNPSSKRTEQRI ELFQILRPDEHIAK 180 

Uy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 230 

IM : I : M MIM IMM llh I M I : | M 

Db 181 QRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEISIHCPCHTFQPNGDILENV 240 

Qy 231 - -TLHVEINGFNS GRRGDLATIHGM NRPFLLLMATPLER- -AQHLHSSRHRRAL 280 

■ - I - I I I I : = I MM I I = I MM 

Db 241 HEVMEIKFKGVDNEDDHGRGDLGRLKKQKDHHNPHLILMMI PPHRLDSPGQGGQRKKRAL 300 

Qy 281 DTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSL 34 0 



Db 

Qy 
Db 



301 



341 



llllll : 
DTNYCFRNL- 



hlllll MIMhMIIMMIIIIhMII llllh I 
-EENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSS 



352 



398 



DTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 

II :| II III II llhllllll MM I : I I I I I I I I I I I I I I U = I I I I I 
3 53 DTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 410 
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OM protein - protein search, using sw model 
Run on: 



Title: 
Perfect score: 2114 



October 28, 2003, 00:08:41 ; Search time 7.97725 Seconds 

(without alignments) 
2346.251 Million cell updates/sec 

US-10-017-372E-13 



Sequence : 



1 MAPSGLRLLPLLLPLLWLLV GRKPKVEQLSNMIVRSCKCS 3 98 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 127863 seqs, 47026705 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



127863 



Database 



SwissProt 41:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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P07200 sus scrofa 


2 
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P09533 cercopithec 


5 
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89 


.8 
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1 


TGFl* 


CAN FA 
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ALIGNMENTS 



RESULT 1 
TGF1_PIG 

ID TGF1_PIG STANDARD; PRT; 39 0 AA . 

AC P07200; P08832; 

DT 01-APR-1988 (Rel. 07, Created) 

DT Ol-APR-1988 (Rel. 07, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Sus scrofa (Pig) . 

OC Eukaryota,- Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
OC Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 
OX NCBI_TaxID=9823 ; 
RN [1] 



RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary ; 

RX MEDLINE=87174844; PubMed=347 0708 ; 

RA Derynck R . , Rhee L . ; 

RT "Sequence of the porcine transforming growth factor-beta precursor . " ; 

RL Nucleic Acids Res. 15:3187-3187(1987). 

RN [2] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RC STRAIN=Mmiature swine; 

RX MEDLINE=8 9054 010; PubMed=2461367 ; 

RA Kondaiah P., van Obberghen-Schill ing E. , Ludwig R.L., Dhar R. , 

RA Sporn M.B., Roberts A.B.; 

RT " cDNA cloning of porcine transforming growth factor-beta 1 mRNAs . 

RT Evidence for alternate splicing and polyadenylat ion . " ; 

RL J . Biol. Chem. 263:18313-18317(1988). 

RN [3] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RX MEDLINE=88335639; PubMed=3 16652 0 ; 

RA Jakowlew S.B., Dillard P.J., Sporn M.B., Roberts A.B.; 

RT "Nucleotide sequence of chicken transforming growth factor-beta 1 

RT (TGF-beta 1) 

RL Nucleic Acids Res. 16:8730-8730(1988). 

RN [4] 

RP SHOWS THAT REF . 3 SEQUENCE IS FROM PIG. 

RA Jakowl ew S . B . ; 

RL Unpublished observations (MAR-1996) . 

RN [5] 

RP SEQUENCE FROM N.A., AND VARIANT VAL-114. 

RA Wimmers K. , Chomdej S., Ponsuksili S., Schellander K. ; 

RT "Polymorphism in the porcine transforming growth factor beta 1 

RT gene . 11 ; 

RL Submitted (DEC-2001) to the EMBL/ GenBank/DDB J databases. 

RN [6] 

RP SEQUENCE OF 279-322. 

RX MEDLINE=87102 8 90; PubMed=287963 5 ; 

RA Cheifetz S., Weatherbee J.A. , Tsang M.L.S., Anderson J.K., Mole J.E., 

RA Lucas R., Massague J.; 

RT "The transforming growth factor-beta system, a complex pattern of 

RT cross-reactive ligands and receptors."; 

RL Cell 48:409-415(1987). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC -!- CAUTION: REF . 3 SEQUENCE WHICH WAS SAID TO ORIGINATE FROM CHICKEN 
CC WHITE LEGHORN, SEEMS (REF. 4) TO ORIGINATE FROM PIG. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 



CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to 1 icense@isb-sib . ch) . 



DR EMBL; Y00111; CAA68291.1; -. 

DR EMBL; M23703; AAA64616.1; -. 

DR EMBL; X12373; CAA30933.1; -. 

DR EMBL; AF461808; AAL57902.1; 

DR PIR; A27512; A27512. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb • 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF0 068 8; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PRO 1423; TGFB ETA. 

DR ProDom; PD0 003 57; TGFb; 1. 

DR SMART; SM0 02 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA__1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal; Polymorphism. 



FT 


SIGNAL 


1 


23 


POTENTIAL. 




FT 


PROPEP 


24 


278 






FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH 


FACTOR BETA 1 . 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 




FT 


DISULFID 


293 


356 


BY SIMILARITY. 




FT 


DISULFID 


322 


387 


BY SIMILARITY. 




FT 


DISULFID 


326 


389 


BY SIMILARITY. 




FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N- LINKED (GLCNAC. . 


. ) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE 


(POTENTIAL) . 


FT 


VARIANT 


114 


114 


L -> V. 




FT 


CONFLICT 


6 


7 


LR -> PG (IN REF. 3) 




FT 


CONFLICT 


180 


180 


R -> G (IN REF. 3) . 




FT 


CONFLICT 


237 


237 


N -> NA (IN REF. 3) . 




SQ 


SEQUENCE 


390 AA; 


44294 


MW; A6E2C3659FC384E6 CRC64 ; 



Query Match 96.3%; Score 2035; DB 1; Length 390; 

Best Local Similarity 97.0%; Pred. No. 7.2e-153; 

Matches 386; Conservative 1; Mismatches 3; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 



CC 



Db 




Qy 



61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 



Db 




Qy 



121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 180 



Db 



121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 



Qy 



181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

1 1 1 1 1 1 'Mil : ii i n i ii 1 1 ;i 1 1 'i ii i ;i 1 1 1 1 1 1 ii i i 1 1 1 1 1 1: i ii 1 1 1 ii 



Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHCSCDSKDNTLHVEINGFN 240 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 300 

IIIIMIIIMIIIIIMMIIHMIIIMIMIMMIMIIIIIM III 

Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKN 2 92 

Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

iiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiimiii 

Db 293 CCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

'1 1 1 1 1 1 1 1 1 1 1 i I II I !i ! I M 1 1 1 

Db 353 APCCVPQALEPLPIVTYVGRKPKVEQLSNMIVRSCKCS 390 

RESULT 2 
TGF1_SHEEP 

ID TGF1_SHEEP STANDARD; PRT; 3 90 AA. 

AC P50414; 

DT 01-OCT-1996 (Rel. 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Ovis aries (Sheep) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminant ia; Pecora; Bovoidea; 

OC Bovidae; Caprinae; Ovis. 

OX NCBI_TaxID=9940; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=95121932; PubMed=782 1 8 0 9 ; 

RA Woodall C.J., McLaren L.J., Watt N.J.; 

RT "Sequence and chromosomal localisation of the gene encoding ovine 

RT latent transforming growth factor-beta 1 . " ; 

RL Gene 150:371-373(1994). 

RN [2] 

RP SEQUENCE OF 281-390 FROM N.A. 

RC STRAIN=Merino ; TISSUE=Skin ; 

RX MEDLINE=95268698 ; PubMed = 774 962 1 ; 

RA Sutton R., Ward W.G. , Raphael K.A. , Cam G.R.; 

RT "Growth factor expression in skin during wool follicle development."; 

RL Comp. Biochem. Physiol. HOB: 697-705 (1995) . 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide- 1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@xsb-sib . ch) . 

CC 

DR EMBL; X76916; CAA54242.1; -. 

DR EMBL; L36038; AAA31526.1; -. 

DR PIR; 146463; 146463. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 23 POTENTIAL. 

FT PROPEP 24 278 POTENTIAL. 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 285 294 BY SIMILARITY. 

FT DISULFID 293 356 BY SIMILARITY. 

FT DISULFID 322 387 BY SIMILARITY. 

FT DISULFID 326 389 BY SIMILARITY. 

FT DISULFID 355 355 INTERCHAIN (BY SIMILARITY) . 

FT CARBOHYD 82 82 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 136 136 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 176 176 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT SITE 244 246 CELL ATTACHMENT SITE (POTENTIAL) . 

SQ SEQUENCE 390 AA; 44291 MW; 1C2472 994 84D0E57 CRC64 ; 

Query Match 91.5%; Score 1934; DB 1; Length 390; 

Best Local Similarity 91.7%; Pred. No. 6.5e-145; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

i i : 1 1 1 1 1 : 1 1 1 1 1 1 ii i : , 1 1 1 1 1 1 1 1 1 ; i ii 1 1 ii i r 1 iiiiiiiiiiiiiii 

Db 1 MPPSGLRLLPLLLPLLWLLMLTPGRPVAGLSTCKTIDMELVKRKGIEAIRGQILSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllllllllllllhllltlllllllllll I lllllllllllllllllll ||:| 
Db 61 SPPSQGDVPPGPLPEAILALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYGNKI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

III I : Ihll , I Mil I MM !l I III I'-ll I Mil II I II II I MM I h III 
Db i2l YDKMKSSS'HS I YMFFNTSELREAVPEPVLLSRADVRLLRLKLKVEQHVELYQKYSNNSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

1 1 1 1 M 1 1 II II I MM 1 1 M II I MM I 1 1 MM, MM I 1 1 1 1 1 1 1 MM M 

Db 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTHREEIEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 300 

M M MM ! MMM I M Mi MMIM MM MMM IM MIMi MM MM III 

Db 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKN 292 



Qy 301 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

M 1 1 i 1 1 h I !l I ! i: M Ml III 1 1 1 1 1 1 1 1 M 1 1 1 1 ! II 1 1 1 1 1 1 II 1 1 1 1 II I II I 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIIIMIIIMIM IMIIIIIilMIII I'M 

Db 3 53 APCCVPQALEPLP IVY YVGRKPKVEQLSNM I VRSCKCS 3 90 



RESULT 3 
TGF1_HUMAN 

ID TGF1_HUMAN STANDARD; PRT; 3 90 AA. 

AC P01137; Q9UCG4; 

DT 21-JUL-1986 (Rel . 01, Created) 

DT 01-FEB-1991 (Rel. 17, Last sequence update) 

DT 15-SEP-2003 (Rel. 42, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 OR TGFB . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBIJTaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=87174845; PubMed=34 7 07 09 ; 

RA Derynck R., Rhee L. , Chen E.Y., van Tilburg A. ; 

RT "Intron-exon structure of the human transforming growth factor-beta 

RT precursor gene."; 

RL Nucleic Acids Res. 15:3188-3189(1987). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=85296301; PubMed=3 8 6 1 94 0 ; 

RA Derynck R., Jarrett J. A. , Chen E.Y., Eaton D.H., Bell J.R., 

RA Assoian R.K., Roberts A.B., Sporn M.B., Goeddel D.V.; 

RT "Human transforming growth factor-beta complementary DNA sequence and 

RT expression in normal and transformed cells."; 

RL Nature 316:701-705(1985). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Duodenum, and Eye; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B . , Buetow K.H. , Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M. , Hong L. , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Browns tern M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J. , Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J. , Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S w Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E. , Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 



RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A . , Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences . " ; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [4] 

RP SEQUENCE OF 279-390 FROM N.A. 

RC TISSUE=Carcinoma ; 

RA Urushizaki Y . , Niitsu Y., Terui T. # Koshida Y . , Mahara K. , Kohgo Y., 

RA Urushizaki I., Takahashi Y. , Ito H.; 

RT "Cloning and expression of the gene for human transforming growth 

RT factor-beta in Escherichia coli."; 

RL Tumor Res. 22:41-55(1987). 

RN [5] 

RP SEQUENCE OF 279-329. 

RC TISSUE=Bladder carcinoma; 

RX MEDLINE=93229900; PubMed=84 7 184 6 ; 

RA Bourdrel L. , Lin C.-H., Lauren S.L., Elmore R.H., Sugarman B.J., 

RA Hu S., Westcott K.R.; 

RT "Recombinant human transforming growth factor-beta 1: expression by 

RT Chinese hamster ovary cells, isolation, and characterization."; 

RL Protein Expr. Purif. 4:130-140(1993). 

RN [6] 

RP SEQUENCE OF 279-301. 

RX MEDLINE=85131019; PubMed=2982829 ; 

RA Massague J., Like B . ; 

RT "Cellular receptors for type beta transforming growth factor. Ligand 

RT binding and affinity labeling in human and rodent cell lines."; 

RL J. Biol. Chem. 260:2636-2645(1985). 

RN [7] 

RP STRUCTURE BY NMR OF 279-390. 

RX MEDLINE=93144319; PubMed=8424 942 ; 

RA Archer S. J. , Bax A., Roberts A.B., Sporn M.B., Ogawa Y., Piez K.A. , 

RA Weatherbee J. A. , Tsang M.L.-S., Lucas R. , Zheng B.-L., Wenker J., 

RA Torchia D. A. ; 

RT "Transforming growth factor beta 1: NMR signal assignments of the 

RT recombinant protein expressed and isotopically enriched using Chinese 

RT hamster ovary cells."; 

RL Biochemistry 32:1152-1163(1993). 

RN [8] 

RP STRUCTURE BY NMR OF 279-390. 

RX MEDLINE=93144320; PubMed=8424 943 ; 

RA Archer S.J., Bax A . , Roberts A.B., Sporn M.B., Ogawa Y., Piez K.A. , 

RA Weatherbee J. A. , Tsang M.L.-S., Lucas R. , Zheng B.-L., Wenker J., 

RA Torchia D.A. ; 

RT "Transforming growth factor beta 1: secondary structure as determined 

RT by heteronuclear magnetic resonance spectroscopy."; 

RL Biochemistry 32:1164-1171 (1993) . 

RN [ 9 ] 

RP STRUCTURE BY NMR OF 279-390. 

RX MEDLINE=96266150; PubMed=86796 13 ; 

RA Hinck A. P., Archer S.J., Qian S.W., Roberts A.B., Sporn M.B., 

RA Weatherbee J. A., Tsang M.L.-S., Lucas R. , Zheng B.-L., Wenker J., 

RA Torchia D.A. ; 

RT "Transforming growth factor beta 1: three-dimensional structure in 

RT solution and comparison with the X-ray structure of transforming 

RT growth factor beta 2 . " ; 



RL Biochemistry 35:8517-8534(1996). 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROLS PROLIFERATION, 
CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulfide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 
CC SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; X05839; CAA29283.1; 

DR EMBL; X05840; CAA29283.1; JOINED . 

DR EMBL; X05843; CAA29283.1; JOINED. 

DR EMBL; X05844; CAA29283.1; JOINED. 

DR EMBL; X05849; CAA29283.1; JOINED. 

DR EMBL; X05850; CAA29283.1; JOINED. 

DR EMBL; X02812; CAA26580.1; ALT_SEQ . 

DR EMBL; BC001180; AAH01180.1; 

DR EMBL; BC000125; AAH00125.1; -. 

DR EMBL; BC022242; AAH22242.1; -. 

DR EMBL; M38449; AAA36735.1; -. 

DR PIR; A27513; WFHU2 . 

DR PDB; 1KLA; 17-AUG-96. 

DR PDB; 1KLC; 17-AUG-96. 

DR PDB; 1KLD; 17 -AUG- 96. 

DR Genew; HGNC : 11766 ; TGFB1 . 

DR MIM; 190180; -. 

DR GO; GO: 0006916; P : ant i -apoptosis ; TAS . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGF_TGFb. 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFbjpropept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS0025 0; TGF_BETA_1 ; 1. 

KW Growth tactor; Mitogen; Glycoprotein; Signal; 3D-structure . 

FT SIGNAL 1 23 POTENTIAL. 

FT PROPEP 24 278 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 285 294 

FT DISULFID 293 356 

FT DISULFID 322 387 

FT DISULFID 326 389 

FT DISULFID 355 355 INTERCHAIN. 



FT 


CARBOHYD 


82 


82 


N- LINKED (GLCNAC ) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC ) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC ) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT QTTF (POTFNTTAT^ 


FT 


CONFLICT 


10 


10 


L -> P (IN REF 2) 


FT 


CONFLICT 


159 


159 


R -> RR (IN REF 2) 


FT 


STRAND 


281 


281 




FT 


TURN 


282 


287 




FT 


STRAND 


294 


296 




FT 


STRAND 


300 


300 




FT 


TURN 


302 


305 




FT 


STRAND 


313 


313 




FT 


STRAND 


317 


317 




FT 


STRAND 


321 


323 




FT 


HELIX 


335 


346 




FT 


TURN 


348 


349 




FT 


STRAND 


358 


370 




FT 


TURN 


371 


372 




FT 


STRAND 


373 


387 




SQ 


SEQUENCE 


390 AA; 


44341 


MW; 75391614250288FE CRC64 ; 



Query Match 90.9%; Score 1921; DB 1; Length 39 0; 

Best Local Similarity 91.7%; Pred. No. 6.8e-144; 

Matches 365; Conservative 10; Mismatches 15; Indels 8; Gaps 1; 



Qy 


i 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 

1 lllllll ! 1 Mi 1 1 1 1 M ! 1 1 1 II ' 1 1 II 1 1 1 1 1 M 1 II II 1 II 1 1 1 1 1 1 , 1 M 1 

MPPSGLRLLLLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 


60 


Db 


l 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

IIIMhMIIMIIIIIIIIIIIIIIIIIII 1 1 1 1 1 1 1 1 1 II 1 II II 1 II 1 Ih hi 

SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVETHNEI 


120 


Db 


61 


120 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 
Mill : Ihil lllllllllllllllllllllllllllllllllllllllllhlll 


180 


Db 


121 


YDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 


180 


Qy 


181 


YLSNRLLAPSDS PEWLSFDVTGVVRQWLTRREAI EGFRLSAHSSSDSKDNTLHVE I NGFN 

lilililllllllllllliiliiiillhl 'nil : i ihiiii hill 

YLSNRLLAPSDSPEWLSFDVTGWRQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFT 


240 


Db 


181 


240 


Qy 


241 


SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 

hllllllllMIIIIIIIIIIIIMIIMI h 1 1 1 1 Ih II 1 II II III 

TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 


300 


Db 


241 


292 


Qy 


301 


CCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 60 
M II 1 1 1 ! 1 1 1 1 1 1 M M 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 M 1 M 1 1 1 1 1 M M 1 1 II 1 1 1 


Db 


293 


M 1 1 M 1 1 1 1 1 [ 1 1 1 M 1 1 I 1 | | | | | I 1 I | I | 1 I ] | | | I I | | | | | | | | | | M | | | 

CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 


352 


Qy 


361 


APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

1 1 1 1 1 1 II h 1 1 1 1 II 1 1 1 1 II 1 II hi 1 II h 1 1 1! 1 

APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 




Db 


353 





RESULT 4 
TGF1_CERAE 

ID TGF1_CERAE STANDARD; PRT; 3 90 AA. 



AC P09533; 

DT 01-MAR-1989 (Rel . 10, Created) 

DT 01-MAR-1989 (Rel. 10, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1. 

OS Cercopithecus aethiops (Green monkey) (Grivet) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Cercopithecus . 

OX NCBI__TaxID=9534; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=87246074; PubMed=34 74 13 0 ; 

RA Sharpies K., Plowman G.D., Rose T.M. , Twardzik D.R., Purchio A.F.; 

RT "Cloning and sequence analysis of simian transforming growth 

RT factor-beta cDNA . 11 ; 

RL DNA 6:239-244 (1987) . 

CC -!- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROL PROLIFERATION, 
CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinformat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; M16658; AAA35369.1; -. 

DR PIR; A26960; A26960. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR003911; TGFJTGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PRO 0438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PDO 00357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROS1TE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 16 POTENTIAL. 

FT PROPEP 17 278 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 285 294 BY SIMILARITY. 

FT DISULFID 293 356 BY SIMILARITY . 

FT DISULFID 322 387 BY SIMILARITY. 



FT 


DISULFID 


7 9 
o o 


1 9, 9 
J» o > 


BY ^TMTT.APTTY 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44356 


MW; DFF63E2BAB44320E CRC64 ; 


Query Match 




90. 8 J 


i>; Score 1919; DB 1; Length 3 90; 



Best Local Similarity 91.5%; Pred. No. 9.8e-144; 

Matches 364; Conservative 10; Mismatches 16; Indels 8; Gaps 1; 



Qy 


1 


MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQI LSKLRLA 

1 1 1 1 III 1 i 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II li 1 1 II 1 ll 1 1 1 1 , II IIIIIIIIIMI 
MP PSGLRLLPLLLPLLWLLVLTPSRPAAGLSTCKT I DMELVKRKR I ET I RGQI LSKLRLA 


60 


Db 


l 


60 


Qy 


61 


SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 

1 1 II MM 1 1 M II 1 MM 1 1 1 M II ! M 1 1 1 llllllllllllllllllllll: |:| 

^PP^nnFVPPnPT.PFAVT.ATiYKI^TRnRVAnP^APPFPFPFAnYYAK^/TRVT.MVRTHMPT 
o t r oyuD v rrijrijr l-lc\ v x-Ltt.XJ 1 1\ 3 x r\ur\. v doalj r l r d r ZirtXJ x L2-\i\nj v x n. v xjim v d i niM c x 


120 


Db 


61 


1ZU 


Qy 


121 


YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 

Mill : Ihll 1 IMMMMMMMMMIM IMMIIIIIIIIMM II 
YDKFKQSTHS I YMFFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNNSWR 


180 


Db 


121 


180 


Qy 


181 


YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 

1 1 1 1,' II II h 1 1 1 1 1 II 1 II 1 1 1 1 1 1 h 1 IIIIIIIM 1 lllllll HIM 

YLSNRLLAPSNS PEWLS FDVTGVVRQWLSRGGE I EGFRLSAHCS CDS KDNTLQVDI NGFT 


240 


Db 


181 


240 


Qy 


241 


SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 

: 1 1 1 1 I 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 II 1 1 II M II 1 1 1 II 1 III 
TGRRGDLATIHGMNRPFLLLMATPLERAQHLQSSRHRRALDTNYCFSST EKN 


300 


Db 


241 


292 


Qy 


301 


CCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 


360 


Db 


293 


M 1 II II 1 1 1 1 1 1 1 1 Ml 1 M 1 Ml M Ml 1 1 MM IIMM Ml IM 1 II Mi MM 

CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 


352 


Qy 


361 


APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

IMIIIIMMMMIIMIIIIMMIIMIIIIMI 




Db 


353 


APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 90 





RESULT 5 
TGF1_CANFA 

ID TGF1_CANFA STANDARD; PRT; 3 90 AA. 

AC P54831; 

DT 01-OCT-1996 (Rel . 34, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth tactor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Can is familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Jugular vein endothelial; 



RX MEDLINE=95237630; PubMed= 772 1110 ; 

RA Manning A.M., Auchampach J. A. , Drong R.F., Slightom J.L.; 

RT "Cloning of a canine cDNA homologous to the human transforming growth 

RT factor-beta 1- encoding gene." ; 

RL Gene 155:307-308(1995). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES . MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-linked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright- It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; L34956; AAA51458.1; -. 

DR PIR; JC4023; JC4023. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR003 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propeptide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 



DR 


PRINTS; ! 


PR01423; 


TGFB ETA . 




DR 


ProDom; : 


PD000357; 


TGFb; 


1 . 




DR 


SMART; SM002 04; TGFB; 1. 






DR 


PROSITE; 


PS00250; 


TGF BETA 1; 1. 




KW 


Growth factor; Mitogen; 


Glycoprotein; Signal . 




FT 


SIGNAL 


1 


23 


POTENTIAL. 




FT 


PROPEP 


24 


278 


BY SIMILARITY. 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH 


FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 




FT 


DISULFID 


293 


356 


BY SIMILARITY. 




FT 


DISULFID 


322 


387 


BY SIMILARITY. 




FT 


DISULFID 


326 


389 


BY SIMILARITY. 




FT 


DISULFID 


355 


355 


INTERCHAIN. 




FT 


CARBOHYD 


82 


82 


N -LINKED ( GLCNAC . . 


.) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N -LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC . . 


.) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE 


(POTENTIAL) . 


SQ 


SEQUENCE 


390 AA 


; 4418 


15 MW; EB4780E88B7B590E CRC64 ; 



Query Match 89.8%; Score 1898; DB 1; Length 390; 

Best Local Similarity 90.7%; Pred. No. 4.4e-142; 

Matches 361; Conservative 12; Mismatches 17; Indels 8; Gaps 1; 



Qy 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 



I 1 1 1 1 II II II II M 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 

Db 1 MPPSGLRLLPLLLPLLRLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLS 60 

Qy 61 SPPSQGDVPPGPLPEAVIALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

I I I I I I : ! I I I , II I I Ml I I I I I I i ! I I : I IN III I I I I I II I I I I I I I h hi 
Db 61 SPPSQGEVPPVPLPE^VLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVENTNKI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I UlhlllMIIIIIIIMIIIIIIIIIIIIIIIIII Ml 1 1 1 1 M I II III I 

Db 121 YEKVKKSPHS I YMLFNTSELREAVPEPVLLSRAELRLLRLKLKAEQHVELYQKYSNDSWR 180 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

I II II M M I h M 1 M 1 1 1 II I M II h =11111111 I MINN hlllh 

Db 181 YLSNRLLAPSDTPEWLSFDVTGWRQWLSHGGEVEGFRLSAHCSCDSKDNTLQVDINGFS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

I h III I I I I ! I ll I I I ll I I I I I I I I I I I I I lllllllllllll Ml 

Db 241 SSRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRQRRALDTNYCFSST EKN 2 92 

Qy 3 01 CCWQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHNPGASA 360 

I : l| 1 1 1 1 1 1 1 1 1 II I II I II 1 1 1 II 1 1 1 1 1 1 1 1 1 II III 1 1 1 1 1 , 1 II 1 1 II I II INI 

Db 293 CCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

MIIIIIIMIIIIIIMIIIIIIIIMIIIMIIMI 

Db 353 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 

RESULT 6 
TGF1_H0RSE 

ID TGF1_H0RSE STANDARD; PRT; 3 90 AA. 

AC 019011; 

DT 15-JUL-1998 (Rel . 36, Created) 

DT 15-JUL-1998 (Rel. 36, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Equus cabal lus (Horse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Perissodactyla ; Equidae; Equus. 

OX NCBI_TaxID=97 96; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lymph node; 

RX MEDLINE=98185507; PubMed=9524819 ; 

RA Penha-Goncalves M.N., Onions D.E., Nicolson L. ; 

RT "Cloning and sequencing of equine transforming growth factor-beta 1 

RT (TGF beta-1) cDNA. " ; 

KL UNA Seq. 7:375-378(1997). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disulfide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 



CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; X99438; CAA678 01.1; -. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; I PRO 03911; TGFJTGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS002 50; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 23 POTENTIAL. 

FT PROPEP 24 278 BY SIMILARITY. 

FT CHAIN 279 390 TRANSFORMING GROWTH FACTOR BETA 1. 

FT DISULFID 285 294 BY SIMILARITY. 

FT DISULFID 293 356 BY SIMILARITY. 

FT DISULFID 322 387 BY SIMILARITY. 

FT DISULFID 326 389 BY SIMILARITY. 

FT DISULFID 355 355 INTERCHAIN (BY SIMILARITY) . 

FT CARBOHYD 82 82 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 136 136 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 176 176 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 390 AA; 43974 MW; A86D715F44549691 CRC64 ; 

Query Match 87.2%; Score 1844; DB 1; Length 390; 

Best Local Similarity 88.4%; Pred. No. 7.9e-138; 

Matches 352; Conservative 12; Mismatches 26; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I M M II I iM I M I MM 1 1 1 1 II 1 1 II M MM 1 1 Ml 1 1 1 1 II 1 1 MM II 1 1 M I 

Db 1 MPPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 6 0 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 120 

lllllhllllllllilllllllll :||||t | IIIIIIIIIIIIIMIIMI hi 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRAQVAGESAETEPEPEADYYAKEVTRVLMVEKENEI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

i = ' i mini 1 1 h i: ii i: 1 ii 1 1 1 mi i mimmimiiimm 

Db 121 YKTVETGSHSI YMFFNTSELRAAVPDPMLLSRAELRLLRLKLSVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

MMMI 1 1 1 1 1 1 II 1 1 1 If 1 1 1 1 1 1 1 : MINIMI! I lllllll I Mlh 

Db 181 YLSNRLLTPSDSPEWLSFDVTGWRQWLSQGGAMEGFRLSAHCSCDSKDNTLRVGINGFS 240 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

I MINIM 1 1 1 11 1 1 M II 1 1 M II I IIIIIIIIIMIIIIMM III 

Db 241 SSRRGDLATIDGMNRPFLLLMATPLERAQQLHSSRHRRALDTNYCFSST EKN 2 92 

Qy 3 01 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

MMMIIIIMMIIMIIMMIIIIMMIIMIIIIIIIIIMIIMIMIIMM 

Db 293 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

mm: 1 immimmiimmmmiiimimii 

Db 353 APCCVPQVLEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 

RESULT 7 
TGF1_M0USE 

ID TGF1_M0USE STANDARD; PRT ; 390 AA. 

AC P04202; 

DT 20-MAR-1987 (Rel. 04, Created) 

DT 20-MAR-1987 (Rel. 04, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebra ta; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCB IJTax I D= 1 0 0 9 0 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=86168129; PubMed=3 007454 ; 

RA Derynck R. , Jarrett J. A., Chen E.Y., Goeddel D.V. ; 

RT "The murine transforming growth factor-beta precursor."; 

RL J. Biol. Chem. 261:4377-4379(1986). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RX MEDLINE=9609654 5; PubMed=8522200 ; 

RA Guron C, Sudarshan C., Raghow R. ; 

RT "Molecular organization of the gene encoding murine transforming 

RT growth factor beta 1 . " ; 

RL Gene 165:325-326(1995). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6, and NOD/LT; TISSUE=Spleen ; 

RA Poirot L., Benoist C. , Mathis D. ; 

RT "Transforming growth factor-beta 1 sequence and expression: no 

RT difference between NOD/Lt and C57B1/6 mouse strains."; 

RL Submitted (AUG-1998) to the EMBL/GenBank/DDB J databases. 

CC FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION , DIFFERENTIATION , AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul f ide-1 inked . 

CC -!- SUBCELLULAR LOCATION : Secreted. 

CC -I- SIMILARITY: Belongs to the TGF-beta family. 

CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 



DR 


EMBL; 


M13177; 


AAA40423 


1; 




DR 


EMBL; 


L42462; 


AAB00138 


1; 




DR 


EMBL; 


L42456; 


AAB00138 


l; 


JOINED 


DR 


EMBL; 


L42457; 


AAB00138 


1; 


JOINED 


DR 


EMBL; 


L42458; 


AAB00138 


1; 


JOINED 


DR 


EMBL ; 


L42459; 


AAB00138 


1; 


JOINED 


DR 


EMBL; 


L42460; 


AAB00138 


1; 


JOINED 


DR 


EMBL; 


L42461; 


AAB00138 


1; 


JOINED 


DR 


EMBL ; 


AJ009862; CAA08900. 


1; 


DR 


PIR; 


A01396; 


WFMS2 . 






DR 


HSSP; 


P01137; 


1KLA. 







DR MGD; MGI: 98725; Tgfbl . 

DR GO; GO:0005578; C : extracel lular ma t r ix ; IDA. 

DR GO; GO: 0006954; P : inflammatory response; IMP. 

DR GO; GO:0007515; P: lymph gland development; IMP. 

DR GO; GO: 0008220; P:necrosis ; IMP. 

DR GO; GO: 0016202; P:regulation of myogenesis; IDA. 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR003 911; TGFJTGFb. 

DR InterPro; IPR00183 9; TGFb . 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA. 

DR ProDom; PD0 00357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF__BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


SIGNAL 


1 


23 


POTENTIAL. 


FT 


PROPEP 


24 


278 




FT 


CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


285 


294 


BY SIMILARITY. 


FT 


DISULFID 


293 


356 


BY SIMILARITY. 


FT 


DISULFID 


322 


387 


BY SIMILARITY. 


FT 


DISULFID 


326 


389 


BY SIMILARITY. 


FT 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


390 AA; 


44310 


MW; 4381A51B711D689E CRC64 ; 



Query Match 8 7.0%; Score 184 0; DB 1; Length 3 90; 

Best Local Similarity 86.9%; Pred . No. 1.6e-137; 

Matches 346; Conservative 15; Mismatches 29; Indels 8; Gaps 1; 



Qy 



1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLA 60 

I lllllllllllll Ml i 1 1 1 1 ii 1 1 1 " |i| | .mil 1 1 1 1 1 1 1 II I M 1 1 II Ml 



Db 1 MPPSGLRLLPLLLPLPWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRIA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I II I I I = I I I II II I I II I I I I I I I I I I I I I I : II I I I I I I II II I I I I I I I h II 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

hi I Ihll MlhMIIIIM lllllllli III llllllllllllhlli 
Db 121 YEKTKDI SHS I YMFFNTSDI REAVPEPPLLSRAELRLQRLKSSVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDS PEWLSFDVTGVVRQWLTRREAI EGFRLSAHSSSDSKDNTLHVE I NGFN 240 

II I I II hi: I I I I I I I I I 1 I I I I I : : hill III I III Nihil 

Db 181 YLGNRLLTPTDTPEWLSFDVTGVVRQWLNQGDG I QGFRFSAHCSCDSKDNKLHVEINGIS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

Mill Ml 1 1 M M 1 1 1 II I M I II M I M I M I II 1 1 1 1 1 1 II I Ml 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKN 2 92 

Qy 301 CC VRQL Y I DFR KDLGWKW I HE P KG YHAN F CLG P C P Y I WS LDTQ YS KVLAL YNQHN PGAS A 360 

II II I II II II I II I M I II II I I I II I I II I I I I M I I I M II I M II I I II I I I I I II 
Db 293 CCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 A PCCVPQALE PL PI VYYVGRKPKVEQLSNMI VRSCKCS 3 98 

h II I Mil! 1 1 1 1 1 M h 1 1 1 1 1 1 h 1 II 1 1 : 1 1 1 1 

Db 353 SPCCVPQALEPLP I VYYVGRKPKVEQLSNMI VRSCKCS 3 90 

RESULT 8 
TGF1_RAT 

ID TGF1_RAT STANDARD; PRT; 3 90 AA. 

AC P17246; 

DT 01-AUG-1990 (Rel. 15, Created) 

DT 01-AUG-1990 (Rel. 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Sprague-Dawley ; TISSUE=Heart ; 

RX MEDLINE=90272425; PubMed=23491 08 ; 

RA Qian S.W. f Kondaiah P., Roberts A.B., Sporn M.B.; 

RT "cDNA cloning by PCR of rat transforming growth factor beta-1."; 

RL Nucleic Acids Res. 18:3059-3059(1990). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES . MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS . 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

cc 



CC This SWISS-PROT entry is copyright- It is produced through a collaboration 
CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
CC the European Bioinformat ics Institute. There are no restrictions on its 
CC use by non-profit institutions as long as its content is in no way 
CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; X52498; CAA36741.1; -. 
DR PIR; S10219; S10219. 
DR HSSP; P01137; 1KLA. 
DR InterPro; IPR0024 00; GF_cysknot . 
DR InterPro; IPR003911; TGF_TGFb . 
DR InterPro; IPR001839; TGFb. 
DR InterPro; IPR001111; TGFb_N . 
DR Pfam; PF00019; TGF-beta; 1. 
DR Pfam; PF00688; TGFb_propept ide ; 1. 
DR PRINTS; PR00438; GFCYSKNOT . 
DR PRINTS; PR014 23; TGFB ETA . 
DR ProDom; PD00 0357; TGFb; 1. 
DR SMART; SM00204; TGFB; 1. 
DR PROSITE; PS00250; TGF_BETA_1 ; 1. 
KW Growth factor; Mitogen; Glycoprotein; Signal. 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



Gaps 1 ; 

Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRIA 60 

I IIIIMIIIIIII IIMIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIMI 

Db 1 MPPSGLRLLPLLLPLPWLLVTiTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVTALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

lllllhlllllllllllllllllllllllll :|||IIIIMIIIIMIIIIh II 
Db 61 SPPSQGEVPPGPLPEAVLALYNSTRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI 12 0 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSFAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

III I Ihll IllhUIIIIII llllllll I I I Ml I I ,! I I I I h I I! 
Db 121 YDKTKDITHSI YMFFNTSDIREAVPEPPLLSRAELRLQRFKSTVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

II I I I I MhllMMIMMIMM = = hill III I Mill I I I I I I I 

Db 181 YLGNRLLTPTDTPEWLSFDVTGWRQWLNQGDGIQGFRFSAHCSCDSKDNVLHVEINGIS 240 



SIGNAL 


1 


23 


POTENTIAL. 


PROPEP 


24 


278 




CHAIN 


279 


390 


TRANSFORMING GROWTH FACTOR BETA 1. 


DISULFID 


285 


294 


BY SIMILARITY. 


DISULFID 


293 


356 


BY SIMILARITY. 


DISULFID 


322 


387 


BY SIMILARITY. 


DISULFID 


326 


389 


BY SIMILARITY. 


DISULFID 


355 


355 


INTERCHAIN (BY SIMILARITY) . 


CARBOHYD 


82 


82 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


136 


136 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


176 


176 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SITE 


244 


246 


CELL ATTACHMENT SITE (POTENTIAL) . 


) SEQUENCE 


390 AA; 


44329 MW; 5E2 1 1 08ED5 0D853C CRC64 ; 


Query Match 




87.0%; 


Score 1840; DB 1; Length 390; 


Best Local Similarity 


86.9%; 


Pred. No. 1.6e-137 ; 


Matches 346; 


Conservative 


14; Mismatches 30; Indels 8; 



Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 300 

Mill III IIIIIIIIIIIMIMIIIIIIIIMMIIIMMII III 

Db 241 PKRRGDLGTIHDMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKN 292 

Qy 3 01 CCWQLYIDFRPCDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 3 60 

IIIIIIIIIIIIMMIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIMIIIIMIMI 

Db 293 CCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

: 1 1 1 1 1! : 1 1 1 1 1 'i ii ' 1 , ii 1 1 m 1 1 ; , i ii ; m 1 1 

Db 353 SPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 

RESULT 9 
TGF1_CAVP0 

ID TGF1_CAVP0 STANDARD; PRT ; 390 AA . 

AC Q9Z1Y6; Q9QZB3; Q9R148; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) . 

GN TGFB1 . 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI__TaxID=10141; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Hartley; 

RA Jeevan A., McMurray D.N. , Yoshimura T.; 

RT "Guinea pig transforming growth factor-beta in peritoneal exudates 

RT after BCG vaccination."; 

RL Submitted (OCT- 1999) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 265-382 FROM N.A. 

RX MEDLINE=9914 4 670; PubMed=10 02 5978 ; 

RA Scarozza A.M., Ramsingh A.I., Wiener v., Wiener K. ; 

RT "Spontaneous cytokine gene expression in normal guinea pig blood and 

RT tissues."; 

RL Cytokine 10:851-859(1998). 

RN [3] 

RP SEQUENCE OF 279-371 FROM N.A. 

RC STRAIN=Hartley; TISSUE=Trachea ; 

RA Morishima Y., Uchida Y., Nomura A., Ishii Y., Sakamoto T. , 

RA Sekizawa K. ; 

RT "Guinea-pig transforming growth factor-beta expression in injured 

RT tracheal epithelium."; 

RL Submitted (JUL-1999) to the EMBL/GenBank/DDBJ databases. 

CC -I- FUNCTION: MULTIFUNCTIONAL PEPTIDE THAT CONTROLS PROLIFERATION, 

CC DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL TYPES. MANY 

CC CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF THEM HAVE 

CC SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES A 

CC POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS . 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked (By similarity) . 

CC -1- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Biomformatics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 















DR 


EMBL; AF191297; 


AAF02780 . 


1; - 




DR 


EMBL; AF097509; 


AAC83807. 


1; - 




DR 


EMBL; AF169347; 


AAD49347 . 


1; - 




DR 


HSSP; P01137; 1KLA. 






DR 


InterPro; 


IPR0024 00; GF_cysknot . 


DR 


InterPro; 


IPR003911; TGF 


TGFb . 


DR 


InterPro ; 


IPR001839; TGFb 






DR 


InterPro; 


IPR001111; TGFb 


_N. 




DR 


Pfam; PF00019; TGF-beta; 


1 . 




DR 


Pfam; PF00688; TGFb_propept ide ; 1. 


DR 


PRINTS; PR00438; 


GFCYSKNOT . 




DR 


PRINTS; PR01423; 


TGFB ETA. 






DR 


ProDom; PD0003 57; TGFb; 1 






DR 


SMART ; SM00204; 


TGFB ; 1 . 






DR 


PROSITE; 


PS00250; TGF BETA 1; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


24 




POTENTIAL . 


FT 


PROPEP 


25 


278 




POTENTIAL. 


FT 


CHAIN 


279 


390 




TRANSFORMING GROWTH FACTOR BETA 1 


FT 


DISULFID 


285 


294 




BY SIMILARITY. 


FT 


DISULFID 


293 


356 




BY SIMILARITY. 


FT 


DISULFID 


322 


387 




BY SIMILARITY. 


FT 


DISULFID 


326 


389 




BY SIMILARITY. 


FT 


DISULFID 


355 


355 




INTERCHAIN (BY SIMILARITY) . 


FT 


CAR BOH YD 


82 


82 




N-LINKED ( GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


136 


136 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


176 


176 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


244 


246 




CELL ATTACHMENT SITE ( POTENTIAL) . 


FT 


CONFLICT 


279 


279 




G -> P (IN REF. 3) . 


FT 


CONFLICT 


286 


286 




F - > S (IN REF. 2) . 


FT 


CONFLICT 


309 


309 




K - > E (IN REF. 2) . 


FT 


CONFLICT 


322 


322 




C -> R (IN REF. 2) . 


FT 


CONFLICT 


350 


350 




A -> G (IN REF . 2) . 


SQ 


SEQUENCE 


390 AA; 44328 


MW; 


1539F849BA0C0FF1 CRC64 ; 



Query Match 86.6%; Score 1831; DB 1; Length 390; 

Best Local Similarity 86.7%; Pred. No. 8.4e-137; 

Matches 34 5; Conservative 15; Mismatches 30; Indels 8; Gaps 1; 
Qy 1 MAPSGLRLLPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

I II llllllllllllllll MllhlMMIIIIMIIIMIIIMIMIIIIMM 

Db 1 MPPSRLRLLPLLLPLLWLLVLAPGRPASGLSTCKTI DMELVKRKRI EAI RGQI LSKLRLA 60 

Qy 61 S P PSQGDVP PG PLPEAVLALYNSTRDR VAGES VE PE P E PEADYYAKE VTRVLMVESGNQI 12 0 

Ml II MM Mill II Ml MM 1 1 Ml Ml I Ml MM I III Mill MM- = I 

Db 61 SP PSQGDVP PGPL PEA VTiALYNSTRDRVAGESAEPEPEPEPDYYAKEVTRVLMVDNSHN I 12 0 



Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 



I : hi MINIMI Ihhlllllllh 1 1 1 1 II MM II II! I hill 

Db 121 YKS I ETVAHS I YMFFNTSELREAVPDPLLLSRAELRMQRLKLNVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFN 240 

MINI M h I M I M M M M I M h • I MM! Ml I Ml MM Mill 

Db 181 YLSNQLLTPSDTPEWLSFDWGVVRQWLSQGEELEGFRFSAHCSCDSKDNTLRVEINGIG 240 

Qy 241 SGRRGDIATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

MMM 1 1 II t M M I II 1 1 1 1 1 M II I M 1 1 II I MIMIMM Ml 

Db 241 PKRRGDLAAIHGMNRPFLLLMATPLERAQHLHSSRHRRGLDTNYCFSST EKN 292 

Qy 3 01 CCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHNPGASA 360 

IIIIIMIIIIIIMIIIIIIMIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIII 

Db 293 CCVRQLYI DFRKDLGWKWI HEPKGYHANFCLGPCPYI WSLDTQYSKVLALYNQHNPGASA 352 

Qy 361 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

1 1 1 ! 1 1 1 ! 1 1 1 M 1 1 It ! I ! I IMIIIIMIIMIII 

Db 353 APCCVPQALEPLP I VYYVGRKAKVEQLSNMI VRSCKCS 390 

RESULT 10 
TGF1__B0VIN 

ID TGF1_B0VIN STANDARD; PRT; 315 AA. 

AC P18341; 

DT 01-NOV-1990 (Rel, 16, Created) 

DT Ol-NOV-1990 (Rel . 16, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (Fragment) . 

GN TGFB1 . 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminant ia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX MEDLINE=91042552; PubMed=3 1534 59 ; 

RA van Obberghen-Schilling E. , Kondaiah P., Ludwig R.L., Sporn M.B., 

RA Baker C. C. ; 

RT "Complementary deoxyribonucleic acid cloning of bovine transforming 

RT growth factor-beta 1 . " ; 

RL Mol. Endocrinol. 1:693-698(1987). 

RN [2] 

RP SUBUNITS. 

RC TISSUE=Bone; 

RX MEDLINE=92129307; PubMed=1733 936 ; 

RA Ogawa Y. , Schmidt D.K., Dasch J.R., Chang R.J., Glaser C.B.; 

RT "Purification and characterization of transforminq qrowth factor-b^ta 

RT 2.3 and -beta 1.2 heterodimers from bovine bone."; 

RL J. Biol. Chem. 267:2325-2328(1992). 

CC -!- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES . MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

CC ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-linked . Heterodimers of TGF-beta 1/2 



CC have been found in bone. 

CC -!- SUBCELLULAR LOCATION : Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Biomf ormat ics and the EMBL outstation - 

CC the European Bioinf ormat ics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to 1 icense@isb-sib . ch) . 

CC 

DR EMBL; M36271; AAA30778.1; -. 

DR PIR; A40057; A40057. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; I PRO 03 911; TGF_TGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR Pfarn; PF00019; TGF-beta; 1. 

DR Pfarn; PF00688; TGFb_propeptide; 1. 

DR PRINTS; PR00438 ; GFCYSKNOT. 

DR PRINTS; PRO 1423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 



DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1. 


KW 


Growth factor; Mitogen; 


Glycoprotein. 


FT 


N0N_TER 


1 


1 






FT 


PROPEP 


<1 


203 






FT 


CHAIN 


204 


315 




TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


210 


219 




BY SIMILARITY. 


FT 


DISULFID 


218 


281 




BY SIMILARITY. 


FT 


DISULFID 


247 


312 




BY SIMILARITY. 


FT 


DISULFID 


251 


314 




BY SIMILARITY. 


FT 


DISULFID 


280 


280 




INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


7 


7 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


61 


61 




N- LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


CARBOHYD 


101 


101 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


169 


171 




CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


315 AA; 


36269 MW; 


C2717A23D994E00E CRC64 ; 


Query Match 




74 


.6%; 


Score 1577; DB 1; Length 315; 


Best Local Similarity 


91 


.3%; 


Pred. No. 6.3e-117 ; 



Matches 295; Conservative 9; Mismatches 11; Indels 8; Gaps 1; 
Qy 76 AVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLF 135 

hlllllllllllllll I MM Ml I MINI MINI I hill I I : MM I I 

Db 1 AIIALYNSTRDRVAGESAETEPEPEADYYAKEVTRVLMVEYONKT YDKMKSSSHS I YMFF 60 

Qy 136 NTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEW 195 

I I I I I I I I I I I I I I I I I h : I I I I I I I I I I I I I I I I I I I I I = I I I I I | | I I I I I | | | | I I 
Db 61 NTSELREAVPEPVLLSRADVRLLRLJO.KVEQHVELYQKYSNNSWRYLSNRLLAPSDSPEW 120 

Qy 196 LS FDVTG WRQWLTRREA I EGFRLSAHSS S DS KDNTLHVE I NGFNSGRRGDLAT I HGMNR 255 

I ! 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 MIMIIII I IMIMI I : 1 1 1 1 : 1 1 1 [ M I I II I I I I I 

Db 121 LSFDVTGWRQWLTRREEIEGFRLSAHCSCDSKDNTLQVDINGFSSGRRGDLATIHGMNR 180 



Qy 2 56 PFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLG 315 

M 1 1 M I M I II M M M 1 1 II 1 1 M I II M 1 1 1 I II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 

Db 181 PFLLLMATPLERAQHLHSSRHRRALDTNYCFSST EKNCCVRQLYIDFRKDLG 232 

Qy 316 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIV 3 75 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIIIIIIIMIII 

Db 233 WKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIV 292 

Qy 376 YYVGRKPKVEQLSNMIVRSCKCS 398 

1 : 1 1 1 1 1 1 1 ii 1 1 :i 1 1 ! : i ii i 

Db 2 93 YYVGRKPKVEQLSNMIVRSCKCS 315 

RESULT 11 
TGF1_CHICK 

ID TGF1_CHICK STANDARD; PRT; 373 AA. 

AC P09531; 

DT 01-MAR-1989 (Rel. 10, Created) 

DT 01-OCT-1996 (Rel. 34, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (TGF-beta 4) 

DE (Fragment) . 

GN TGFB1 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves ; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBIJTaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=White leghorn; 

RX MEDLINE=89112198; PubMed=24 64 13 1 ; 

RA Jakowlew S.B., Dillard P.J., Sporn M.B., Roberts A.B.; 

RT "Complementary deoxyribonucleic acid cloning of a messenger 

RT ribonucleic acid encoding transforming growth factor beta 4 from 

RT chicken embryo chondrocytes."; 

RL Mol. Endocrinol. 2:1186-1195(1988). 

RN [2] 

RP REVISIONS. 

RX MEDLINE=92357039; PubMed=1353860 ; 

RA Burt D.W., Jakowlew S.B.; 

RT "Correction: a new interpretation of a chicken transforming growth 

RT factor-beta 4 complementary DNA . " ; 

RL Mol. Endocrinol. 6:989-992(1992). 

CC -I- FUNCTION: TGF-BETA IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROL 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA AND ESSENTIALLY ALL OF THEM 

CC HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA REGULATES THE 

Cu ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND DETERMINES 

CC A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: Homodimer; disul fide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 



cc 


use by 


non-profit institutions as long as its content is in no way 


cc 


modified 


and this 


statement is not removed. Usage by and for commercial 


cc 


entities 


requires 


a license agreement (See http://www.isb-sib.ch/announce/ 


cc 
cc 

DR 


or send an email 


to license@isb-sib. ch) . 


EMBL; M31160; AAB05637.1; -. 


DR 


PIR; A41918; A41918. 


DR 


HSSP; P01137; 1KLA. 


DR 


InterPro; 


IPR0039 


11; TGF TGFb. 


DR 


InterPro; 


IPR0018 


39; TGFb. 


DR 


InterPro ; 


IPR001111; TGFb N. 


DR 


Pfam; PF00019; TGF-beta; 1. 


DR 


Pfam; PF00688; TGFbjpropept ide ; 1. 


DR 


PRINTS; PR01423; 


TGF BETA . 


DR 


ProDom; PD000357; 


TGFb; 1. 


DR 


SMART ; SM00204; TGFB; 1. 


DR 


PROSITE; 


PS00250; 


TGF BETA 1; 1. 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


NON TER 


1 


1 


FT 


SIGNAL 


<1 


1 POTENTIAL. 


FT 


PROPEP 


2 


25 9 POTENTIAL. 


FT 


CHAIN 


260 


373 TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


266 


277 BY SIMILARITY. 


FT 


DISULFID 


276 


339 BY SIMILARITY. 


FT 


DISULFID 


305 


370 BY SIMILARITY. 


FT 


DISULFID 


309 


372 BY SIMILARITY. 


FT 


DISULFID 


338 


338 INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


54 


54 N -LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


109 


109 N- LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


153 


153 N- LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


SITE 


224 


226 CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


373 AA; 42634 MW; 9903F3479C8552E5 CRC64 ; 



Query Match 50.3%; Score 1063.5; DB 1; Length 373; 

Best Local Similarity 54.9%; Pred. No. 2.1e-76; 

Matches 209; Conservative 51; Mismatches 100; Indels 21; Gaps 8; 



Qy 30 LSTCKTIDMELVKRKRI EAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYNSTRDRVA 89 

1111= MM hllllhllllllllll =11 = II llh I MINI- : 
Db 2 LSTCQRLDLEAAKKKRI EAVRGQILSKLRLTAPPPASETPPRPLPDDVRALYNSTQELLK 61 

Qy 90 GES-VEPEPEPEADYYAKEVTRVLMVESGNQIYDKFKGTPHSLYMLFNTSELREAVPEPV 148 

: : | |: :|:|||: |: | : : : :: ||:: :|| | | | 
Db 62 QRARLRPPPDGPDEYWAKELRRIPMETTWDGAMEHWQPQSHSI FFVFNVSRARRG-GRPT 12 0 

Qy 14 9 LLSRAELRLLRLKLK VEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTGV 2 03 

II llllhll I II UNI I I Mill I : : llllllll 

Db 121 LLHRAELRMLRQKAAADSAGTEQRLELYQGYGNASWRYLHGRSVRATADDEWLSFDVTDA 1RO 

Qy 204 VRQWLTRREAIEGFRLSAHSSSD SKDNTLHVEINGFNSGRRGDLATIHGMNR- -PFL 258 

I llh I : hi I I ■■•.III :|lh :| '-I I-' 

Db 181 VHQWLSGSELLGVFKLSVHCPCEMGPGHAEEMRISIEGFEQ-QRGDMQSIAKKHRRVPYV 239 

Qy 2 59 LLMATPLERAQHLHSSRHRRALDTNYCFS-STDYKDDDDKEKNCCVRQLYIDFRKDLGWK 317 

I II MM llhl II llhlll II Illllll lllllllll II 

Db 240 LAMALPAERANELHSARRRRDLDTDYCFG PGTD EKNCCVRPLYI DFRKDLQWK 292 



Qy 318 WIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYY 377 

MINI MINIMUM I M h M M I M I M 1 1 M 1 1 M M M hlllhll 

Db 2 93 W I HEPKGYMANFCMGPCPYI WSADTQYTKVLALYNQHNPGASAAPCCVPQTLDPLP 1 1 YY 352 

Qy 378 VGRKPKVEQLSNMI VRSCKCS 398 

Ml MIIMIhlhMM 

Db 3 53 VGRNVRVEQLSNMWRACKCS 373 



RESULT 12 
TGF1_XENLA 

ID TGF1__XENLA STANDARD / PRT; 382 AA. 

AC P16176; 

DT 01-APR-1990 (Rel . 14, Created) 

DT 01-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 1 precursor (TGF-beta 1) (TGF-beta 5) . 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ,- 

OC Amphibia; Batrachia; Anura; Mesobatrachia ; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8355 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=90110090; PubMed=2295601 ; 

RA Kondaiah P., Sands M.J., Smith J.M., Fields A., Roberts A.B., 

RA Sporn M.B., Melton D.A.; 

RT "Identification of a novel transforming growth factor-beta (TGF-beta 

RT 5) mRNA in Xenopus laevis. "; 

RL J. Biol. Chem. 265:1089-1093(1990). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Vempati U.D., Kondaiah P.; 

RL Submitted (AUG-1997) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: IMPORTANT ROLE IN CERTAIN ASPECTS OF DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disulfide- 1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; J05180; AAA49968.1; -. 

DK KMBL; AF009335; AAB64441.1; -. 

DR EMBL; AF009331; AAB64441.1; JOINED . 

DR EMBL; AF009332; AAB64441.1; JOINED. 

DR EMBL; AF009333; AAB64441.1; JOINED. 

DR EMBL; AF009334; AAB64441.1; JOINED. 

DR PIR; A34929; B61036. 

DR HSSP; P01137; 1KLA . 

DR InterPro; IPR0024 00; GF_cysknot . 

DR InterPro; IPR003911; TGFjTGFb. 



DR 


InterPro; 


IPR001839; TGFb . 




DR 


Inter Pro ; 


IPR001111; TGFb N. 




DR 


Pfam; PF00019; TGF-beta; 1. 




DR 


Pfam; PF00688; TGFb_propeptide ; 1. 


DR 


PRINTS; PR00438; 


GFCYSKNOT. 




DR 


PRINTS; PR01423; 


TGFBETA . 




DR 


ProDom; PD0003 57; 


TGFb; 1. 




DR 


SMART ; SM002 04; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


21 


POTENTIAL. 


FT 


PROPEP 


22 


270 




FT 


CHAIN 


271 


382 


TRANSFORMING GROWTH FACTOR BETA 1. 


FT 


DISULFID 


277 


286 


BY SIMILARITY. 


FT 


DISULFID 


285 


348 


BY SIMILARITY. 


FT 


DISULFID 


314 


379 


BY SIMILARITY . 


FT 


DISULFID 


318 


381 


BY SIMILARITY. 


FT 


DISULFID 


347 


347 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


73 


73 


N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


123 


123 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


CARBOHYD 


166 


166 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


SITE 


234 


236 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


382 AA 


; 44200 MW; 


1034621C917AAE15 CRC64 ; 


Query Match 




41.3%; 


Score 874; DB 1; Length 382; 



Best Local Similarity 46.9%; Pred. No. 1.8e-61; 

Matches 190; Conservative 55; Mismatches 122; Indels 38; Gaps 11; 

Qy 9 LPLLLPLLWLLVLTPGRPAAGLSTCKTI DMELVPCRKRI EAI RGQI LSKLRLASPPSQGDV 68 

: M II MM I I I I I I ' I I I I I I I I I II I I I I I I • I I I 

Db 1 MEVTjWMLLVliLvljHLSSIJUyl^ 59 

Qy 6 9 PPGPLPEAVLALYNSTRDRVAGESVEPE PE PEADY YAKEVTR VLMVESGNQ I YDK 123 

: I I I I I I I = MM hi I M :: I 

Db 60 EKMTVPSEAIFLYNSTLEVIREKATREEEHVGHDQNIQDYYAKQVYR FESITELED- 115 

Qy 124 FKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKL- - KVEQHVELYQKYSNDSW- - 17 9 

I M I Ml I M MM: : : ::| :||: || : 

Db 116 HEFKFKFNASHVRENVGMNSLLHHAELRMYKKQTDKNMDQRMELFWKYQENGTTH 17 0 

Qy 18 0 -RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDNTLHVE 23 5 

III = I M M M I I I Ml I I III : Ml 
Db 171 SRYLESKYITPVTDDEWMSFDVTKTVNEWLKRAEENEQFGLQPACKCPTPQAKD ID 226 

Qy 236 I NGFNSGRRGDLATI HGM - - NRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKD 293 

III Mill- MM- : I II : ||| :| : ||| : 
Db 227 I EGF- PALRGDLASLSSKENTKPYLMITSMPAERI DTVTSSRKKRGVGQEYCFGNNG 282 

Qy 2 94 DDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQ 3 53 

Mlh 1 1 1 : 1 1 i , I ) 1 1 ( 1 1 1 ) 1 1 M M II II 1 1 II M 1 1 1 II 1 1 M M I 

Db 283 PNCCVKPLYINFRKDLGWKWIHEPKGYEANYCLGNCPYIWSMDTQYSKVLSLYNQ 337 

Qy 3 54 HNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

MUM M MM MM MM III I MM MUM III II 

Db 338 NNPGASISPCCVPDVLEPLPIIYYVGRTAKVEQLSNMWRSCNCS 382 



RESULT 13 
TGF3_CHICK 

ID TGF3__CHICK STANDARD; PRT; 412 AA. 

AC P16047; 

DT Ol-APR-1990 (Rel . 14, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus, 

OX NCB I _Tax I D = 9 0 3 1 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=89096966; PubMed=32 1 1 158 ; 

RA Jakowlew S.B., Dillard P.J., Kondaiah P., Sporn M.B., Roberts A.B.; 

RT "Complementary deoxyribonucleic acid cloning of a novel transforming 

RT growth factor-beta messenger ribonucleic acid from chick embryo 

RT chondrocytes . " ; 

RL Mol. Endocrinol. 2:747-755(1988). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=White leghorn; 

RX MEDLINE=95169270; PubMed=78 65 12 9 ; 

RA Burt D.W., DeyB.R., Patonl.R., Morrice D.R. , Law A.S.; 

RT "The chicken transforming growth factor-beta 3 gene: genomic 

RT structure, transcriptional analysis, and chromosomal location."; 

RL DNA Cell Biol. 14:111-123(1995). 

RN [3] 

RP SEQUENCE OF 1-117 FROM N.A. 

RC STRAIN=White leghorn; TISSUE=Blood; 

RX MEDLINE=92134496; PubMed=184 0616 ; 

RA Burt D.W., Dey B.R., Paton I.R.; 

RT "Comparative analysis of human and chicken transforming growth 

RT factor-beta 2 and -beta 3 promoters."; 

RL J. Mol. Endocrinol. 7:175-183(1991). 

RN [4] 

RP SEQUENCE OF 1-117 FROM N.A. 

RX MEDLINE=93024487; PubMed= 14 06706 ; 

RA Jakowlew S.B., Lechleider R. , Geiser A.G., Kim S.J., 

RA Santa-Coloma T.A., Cubert J., Sporn M.B., Roberts A.B.; 

RT "Identification and characterization of the chicken transforming 

RT growth factor-beta 3 promoter."; 

RL Mol. Endocrinol. 6:1285-1298(1992). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENES I S AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disul fide- 1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 



CC or send an email to 1 icense@isb-sib . ch) . 



CC 
DR 


EMBL; M31154; AAA49089.1, 




DR 


EMBL; X58127; CAA41128.2 




DR 


EMBL; X60055; CAA41128.2, 


JOINED . 


DR 


EMBL; X60091; CAA41128.2, 


JOINED. 


DR 


EMBL; X60090; CAA41128.2, 


JOINED. 


DR 


EMBL; S46000; AAB23575.1, 




DR 


PIR; A34939; A34939. 




DR 


HSSP; P10600; 1TGJ . 




DR 


InterPro; 


IPR0024 00; GF cysknot . 


DR 


InterPro; 


IPR003911; TGF 


TGFb. 


DR 


InterPro; 


IPR001839; TGFb . 


DR 


InterPro; 


IPR001111; TGFb N. 


DR 


Pfam; PF00019; TGF-beta; 


1 . 


DR 


Pfarn; PF00688; TGFbjpropept ide ; 1. 


DR 


PRINTS; PRO 043 8; GFCYSKNOT . 


DR 


PRINTS; PR01423; TGFBETA 




DR 


ProDotti; PD000357; TGFb; 1. 


DR 


SMART; SM00204; TGFB; 1. 




DR 


PROSITE; PS00250; TGF_BETA_1 ; 1. 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 23 


POTENTIAL . 


FT 


PROPEP 


24 300 




FT 


CHAIN 


301 412 


TRANSFORMING GROWTH FACTOR BETA 3. 


FT 


DISULFID 


307 316 


BY SIMILARITY. 


FT 


DISULFID 


315 378 


BY SIMILARITY. 


FT 


DISULFID 


344 409 


BY SIMILARITY. 


FT 


DISULFID 


348 411 


BY SIMILARITY. 


FT 


DISULFID 


377 377 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


74 74 


N-LINKED ( GLCNAC . . .) (POTENTIAL) 


FT 


CARBOHYD 


135 135 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


142 142 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


261 263 


CELL ATTACHMENT SITE (POTENTIAL) . 


FT 


CONFLICT 


323 326 


ELPT ~> DFRQ (IN REF . 1) . 


SQ 


SEQUENCE 


412 AA; 47077 MW; 1CAB883 170069D55 CRC64; 


Query Match 


40.5%; Score 857; DB 1; Length 412; 


Best Local Similarity 46.4%; Pred. No. 4.4e-60; 


Matches 195; 


Conservative 


56; Mismatches 117; Indels 52; 



Qy 15 LLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI E1AI RGQI LSKLRLASPPSQGDVPPGPLP 74 

h I = h : Ihl hhl :hl hi Mill III III I III II =| 

Db 9 LVLLSLLSFATVSLALSSCTTLDLEHI KKKRVEAI RGQI LSKLRLTSPPE - - SVGPAHVP 66 

Qy 75 EAVLALYNSTRDRVAGESVEPEPE PEADYYAKEVTRVLMVE SGNQI YDKF 124 

= 1 I II III h = I =1 I I h:||llh = I- | = = 

Db 67 YQILALYNSTRELL- -EEMEEEKEESCSQENTESEYYAKEIHKFDMIQGTiPKHNELGI CP 124 

Qy 125 KGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND 177 

II — Ml I I III h I h : II Hhl I 

Db 125 KGVTSNVFR - FNVS SAEKNSTNLFRAEFRVLRVPNPSSKRSEQRI ELFQILRPDEHI 180 

Qy 178 -SWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKD 229 

I I I I I = IIIIIIMM Ihll Ih I =1 I : I : 

Db 181 AKQRYLSGRNVQTRGSPEWLSFDVTDTVREWLLHRESNLGLEI S I HCPCHTFQPNGDI LE 24 0 



Qy 230 N TLHVE I NGFNSG RRGDLATI HGMNRPFLLLMATPLERAQH- -LHSSRHRR 278 

I I - I U II I I • - I h I I I I = I I = I 

Db 241 NLHEVLEI KFKGI DSEDDYGRGDLGRLKKQKDLHNPHLILMMLPPHRLESPTLGGQRKKR 300 

Qy 279 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 33 8 

IIMIIM = hlllll |||: MlllhllllM I I I I Mill: 

Db 301 ALDTNYCFRNL, EENCCVRPLYIELPTDLGWKWVHEPKGYFANFCSGPCPYLR 352 

Qy 33 9 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 98 

I II MM III II IMMIIIM MM hlllll I MM I llh IH MM 

Db 3 53 SADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 412 

RESULT 14 
TGF3_PIG 

ID TGF3_PIG STANDARD; PRT ; 409 AA . 

AC P15203; 

DT 01-APR-1990 (Rel . 14, Created) 

DT 01-APR-1990 (Rel. 14, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Sus scrofa (Pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetart iodactyla ; Suina; Suidae; Sus. 

OX NCBI_TaxID=9823; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Ovary; 

RX MEDLINE=89091120; PubMed=32 08746 ; 

RA Derynck R. , Lindquist B., Lee A. , Wen D. , Tamm J., Graycar J.L., 

RA RheeL., Mason A. J., Miller D. A. , Coffey R. J., Moses H.L., 

RA Chen E . Y . ; 

RT "A new type of transforming growth factor-beta, TGF-beta 3 . 11 ; 

RL EMBO J. 7:3737-3743(1988). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENESI S AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disulf ide-linked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY; Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormat ics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR fcMBL; X14150; CAA32363.1; -. 

DR PIR; S01825; S01825. 

DR HSSP; P10600; 1TGJ . 

DR InterPro; I PRO 024 00; GF_cysknot. 

DR Inter Pro; IPR003 911; TGFJTGFb. 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 



DR 


PRINTS; 


PR00438; 


GFCYSKNOT . 




DR 


PRINTS; 


PR01423; 


TGFB ETA . 




DR 


ProDom; 


PD000357; 


TGFb ; 


1 . 




DR 


SMART; SM002 04; TGFB ; 1. 






DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1 . 


KW 


Growth factor; Mitogen; 


Glycoprotein; Signal . 


FT 


SIGNAL 


1 


18 




POTENTIAL. 


FT 


PROPEP 


19 


297 






FT 


CHAIN 


298 


409 




TRANSFORMING GROWTH FACTOR BETA 3 . 


FT 


DISULFID 


304 


313 




BY SIMILARITY. 


FT 


DISULFID 


312 


375 




BY SIMILARITY. 


FT 


DISULFID 


341 


406 




BY SIMILARITY. 


FT 


DISULFID 


345 


408 




BY SIMILARITY. 


FT 


DISULFID 


374 


374 




INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


72 


72 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


133 


133 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


140 


140 




N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


259 


261 




CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


409 AA; 46814 MW ; 


B4900235B5CC955E CRC64 ; 


Query Match 




39. 


7%; 


Score 83 8.5; DB 1; Length 4 09; 



Best Local Similarity 45.6%; Pred. No. 1.3e-58; 

Matches 190; Conservative 58; Mismatches 122; Indels 47; Gaps 14; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

h I =1 = =111 hi = ^ I ' I M ! ' I I ' I I I M I ! Ill : :| 

Db 7 LVVXALLNFATVSLSMSTCTTLDFDHI KRKRVEAI RGQI LSKLRLTSPPDPSML- -ANI P 64 

Qy 75 EAVLALYNSTR DRVAGESVE- - PEPEPEADYYAKEVTRVLMV ESGNQI YDKFKG 126 

II llllll = I II : : h:|lllh I = I I = II 

Db 65 TQVLDLYNSTRELLEEVHGERGDDCTQENTESEYYAKEIYKFDMIQGLEEHNDLAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

-III:: I Ml |:||: = M Hhl I 

Db 125 ITSKIFR-FNVSSVEK NETNLFRAEFRVLRMPNPSSKRSEQRIELFQILQPDEHIAK 180 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 230 

Ih : I : 1 I I I I I I I lh|| llh I :| I = I =1 

Db 181 QRY I DGKNLPTRGAAEWLSFDVTDTVREWLLRRESNLGLEI S I HCPCHTFQPNGDI LENI 240 

Qy 231 - -TLHVEINGFNS GRRGDLATIHGM- -NRPFLLLMATPLERAQH- -LHSSRHRRALD 281 

: I : I III! : : I hll I • I : I : I :|||| 

Db 241 QEVMEI KFKGVDSEDDPGRGDLGRLKKKKEHSPHLI LMMI PPDRLDNPGLGAQRKKRALD 300 

Qy 282 TNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLD 341 

Mill : hlllll MlllhlllllhlllllhMM MMh I I 

Db 301 TNYCFRNL EENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSAD 352 

Qy 342 TQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 

I M II III II llh Ml III MM hlllll Mill II hhl MM 

Db 353 TTHS S VLGLYNTLN P EASAS PCCVPQDLE PLT I LYYVGRTAKVEQLSNMWKSCKCS 409 



RESULT 15 
TGF3_MOUSE 

ID TGF3_MOUSE STANDARD; PRT; 410 AA. 

AC P17125; 



DT 01-AUG-1990 (Rel . 15, Created) 

DT 01-AUG-1990 (Rel. 15, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Transforming growth factor beta 3 precursor (TGF-beta 3) . 

GN TGFB3 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata,- Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX N CB I _Ta xID = 10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=90190650; PubMed=2 628 73 0 ; 

RA Miller D.A. , Lee A., Matsui Y., Chen E.Y., Moses H.L., Derynck R.; 

RT "Complementary DNA cloning of the murine transforming growth 

RT factor-beta 3 (TGF beta 3) precursor and the comparative expression 

RT of TGF beta 3 and TGF beta 1 messenger RNA in murine embryos and 

RT adult tissues."; 

RL Mol. Endocrinol. 3:1926-1934(1989). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=91000714; PubMed=22 06556 ; 

RA Denhez F . , Lafyatis R., Kondaiah P., Roberts A.B., Sporn M.B.; 

RT "Cloning by polymerase chain reaction of a new mouse TGF-beta, 

RT mTGF-beta 3 . " ; 

RL Growth Factors 3:139-146(1990). 

CC -!- FUNCTION: INVOLVED IN EMBRYOGENESIS AND CELL DIFFERENTIATION. 

CC -!- SUBUNIT: Homodimer; disulfide-1 inked . 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- SIMILARITY: Belongs to the TGF-beta family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

cc 

DR EMBL; M32745; AAA40422.1; 

DR PIR; A41397; A41397. 

DR HSSP; P10600; 1TGJ. 

DR MGD; MGI : 98727; Tgf b3 . 

DR InterPro; IPR002400; GF__cysknot . 

DR InterPro; IPR003 911; TGF__TGFb . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS0025 0; TGF_BETA_1; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 

FT SIGNAL 1 23 POTENTIAL. 

FT PROPEP 24 298 POTENTIAL. 

FT CHAIN 299 410 TRANSFORMING GROWTH FACTOR BETA 3. 



FT DISULFID 


305 


314 


BY SIMILARITY. 


FT DISULFID 


313 


376 


BY SIMILARITY 


FT DISULFID 


342 


407 


BY SIMILARITY. 


FT DISULFID 


346 


409 


BY SIMILARITY. 


FT DISULFID 


375 


375 


INTERCHAIN (BY SIMILARITY) . 


FT CARBOHYD 


72 


72 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT CARBOHYD 


133 


133 


N-LINKED (GLCNAC . . .) (POTENTIAL) 


FT CARBOHYD 


140 


140 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT SITE 


259 


261 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ SEQUENCE 


410 AA; 


46884 MW; 


250F7048CA432BD6 CRC64 ; 


Query Match 




39.4%; 


Score 832; DB 1; Length 410; 


Best Local Similarity 


45.5%; 


Pred. No. 4.1e-58; 


Matches 190; 


Conservative 58 


Mismatches 122; Indels 48; 



IPS 14; 

Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

h I :| : I I I I hi :|:|hlMIIIMIIIII III = :| 

Db 7 LVVLALLNLATISLSLSTCTTLDFGHIKKKRVEAIRGQILSKLRLTSPPEPSVMT--HVP 64 

Qy 75 EAVLALYNSTR DR VAGESVE - - PEPEPEADYYAKEVTRVLMVE SGNQIYDKFKG 126 

IIIIIIMI = = II I : |::||llh = l = = M = II 

Db 65 YQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFDMIQGLAEHNELAVCPKG 124 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

:: Ml : : I III hlh = II =11 = 1 I 

Db 125 I TSKVFR- FNVSSVEK NGTNLFRAEFR VLRVPNPSSKRTEQRI ELFQI LRPDEHI AK 180 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 23 0 

I h : I : llllllll 11 = 11 llh I : I I : | :| 

Db 181 QRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEISIHCPCHTFQPNGDILENV 240 

Qy 231 - - TLHVE I NGFNS GRRGDLAT I HGM NRPFLLLMATPLER- -AQHLHSSRHRRAL 280 

: - I - I I I I = = I 1 = 11 11= II =111 

Db 241 HEVMEI KFKGVDNEDDHGRGDLGRLKKQKDHHNPHLI LMMI PPHRLDSPGQGSQRKKRAL 3 00 

Qy 281 DTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGVJKWIHEPKGYHANFCLGPCPYIWSL 34 0 

MINI = 1 = 11111 I II I I I = I I I I I I = I I I I I I = I II I llllh I 

Db 301 DTNYCFRNL EENCCVRPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSA 352 

Qy 341 DTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGRKPKVEQLSNMI VRSCKCS 3 98 

II =1 II Ml II llhllllll MM 1 = 11111 IIMIIIIhhIIIII 

Db 3 53 DTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 410 



Search completed: October 28, 2003, 09:08:39 
Job time : 8.97725 sees 

GenCore version 5.1.6 
Copyright (c) 1993 - 2003 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



October 28, 2003, 07:50:55 ; Search time 31.909 Seconds 

(without alignments) 
3218.683 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-017-372E-13 
2114 

1 MAPSGLRLLPLLLPLLWLLV 



GRKPKVEQLSNMI VRSCKCS 3 98 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 830525 seqs, 258052604 residues 

Total number of hits satisfying chosen parameters: 830525 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : SPTREMBL_23 : * 

1 : sp_archea : * 

2: sp_bacteria : * 

3: sp_fungi:* 

4 : sp_human : * 

5 : sp_invertebrate : * 

6 : sp_mammal : * 

7 : sp_mhc : * 

8: sp_organelle : * 

9 : sp_phage : * 

10: sp_plant:* 

11: sp_rodent:* 

12 : sp_virus : * 

13 : sp_vertebrate : * 

14 : sp_unclassif ied; * 

15: sp_rvirus:* 

16 : sp__bacteriap : * 

17: sp_archeap:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


1820 


86 


1 


390 


6 


Q9TUM8 


Q9tum8 equus cabal 


2 


1741 


82 


4 


368 


11 


Q8R4D9 


Q8r4d9 sigmorion hi 


J 


848 


40 


1 


379 


13 


Q8JHF5 


Q8jhf5 sparus aura 


4 


844 


39 


9 


379 


13 


Q8AXK8 


Q8axk8 sparus aura 


5 


832 


39 


4 


412 


11 


Q91YU7 


Q91yu7 mus musculu 


6 


825 . 5 


39 


0 


382 


13 


Q9PWA9 


Q9pwa9 morone chry 


7 


823 .5 


39 


0 


382 


13 


093449 


0934 4 9 oncorhynchu 


8 


818 .5 


38 


7 


414 


11 


Q91VP5 


Q91vp5 mus musculu 


9 


788 . 5 


37 


3 


376 


13 


Q9PTQ2 


Q9ptq2 cyprinus ca 


10 


757.5 


35 


8 


399 


11 


Q9ERB7 


Q9erb7 mesocricetu 


11 


724 


34 


2 


362 


11 


Q99K17 


Q99kl7 mus musculu 



12 


696 


32 . 


9 


130 


11 


Q08714 


Q08 714 mesocricetu 


13 


679 


32 . 


. 1 


124 


6 


Q95N80 


Q95n80 canis famil 


14 


677 


32 . 


0 


361 


13 


Q98854 


Q98854 cyprinus ca 


15 


624 


29 . 


5 


112 


6 


002730 


00273 0 oryctolagus 


16 


597 . 5 


28 . 


3 


255 


11 


Q921T1 


Q921tl mus musculu 


17 


589 


27 . 


. 9 


127 


6 


Q9TV08 


Q9tv08 canis famil 


18 


567 . 5 


26 . 


. 8 


224 


11 


Q8CDZ9 


Q8cdz9 mus musculu 


19 


563 


26 . 


. 6 


200 


13 


Q90YF1 


Q90yfl pleuronecte 


20 


540 


25 . 


. 5 


101 


11 


Q9R184 


Q9rl84 meriones un 


21 


453 . 5 


21 . 


. 5 


179 


13 


Q90YF2 


Q90yf2 pleuronecte 


22 


413 


19 . 


, 5 


88 


13 


Q90YF5 


Q90yf5 pleuronecte 


23 


402 


19 . 


0 


88 


13 


Q90YF7 


Q90yf7 oncorhynchu 


24 


397 


18 . 


. 8 


88 


13 


Q90ZE7 


Q90ze7 acipenser b 


25 


393 


18 . 


6 


87 


13 


042306 


042306 carassius a 


26 


387 


18 . 


. 3 


309 


4 


Q8WV8 8 


Q8wv8 8 homo sapien 


27 


382 


18 . 


. 1 


91 


6 


Q9MYZ1 


Q9myzl capra hircu 


28 


373 


17 . 


. 6 


86 


6 


Q28241 


Q28241 cervus elap 


29 


358 


16 . 


, 9 


81 


6 


Q9N1S3 


Q9nls3 capreolus c 


30 


335 


15 . 


. 8 


375 


13 


Q8UWD8 


Q8uwd8 columba liv 


31 


328 


15 . 


. 5 


389 


13 


Q90YY0 


Q90yy0 ictalurus p 


32 


322 


15 . 


. 2 


375 


13 


Q8AVB2 


Q8avb2 coturnix co 


33 


320 


15 . 


. 1 


87 


13 


Q8JHB6 


Q8jhb6 scophthalmu 


34 


320 


15 


. 1 


375 


13 


Q8UWD7 


Q8uwd7 coturnix ch 


35 


317 


15 . 


. 0 


77 


13 


Q90YF8 


Q90yf8 oncorhynchu 


36 


315 


14 


. 9 


375 


13 


Q98SP0 


Q98sp0 gallus gall 


37 


312 


14 . 


. 8 


375 


6 


Q9GM97 


Q9gm97 equus cabal 


38 


312 


14 . 


, 8 


375 


13 


Q8UWE0 


Q8uwe0 anas platyr 


39 


310 


14 


.7 


375 


13 


Q8UWD9 


Q8uwd9 anser anser 


40 


308 


14, 


. 6 


375 


6 


Q8HY52 


Q8hy52 lepus capen 


41 


305 


14 


.4 


375 


6 


Q95J86 


Q95j86 macaca fasc 


42 


301 . 5 


14 


.3 


376 


13 


Q98TB4 


Q98tb4 oreochromis 


43 


301 


14 


.2 


62 


13 


Q90ZJ7 


Q90zj7 anguilla an 


44 


300.5 


14 . 


.2 


376 


13 


Q90W06 


Q90w06 umbrina cir 


45 


300 


14 


.2 


62 


13 


Q90YF4 


Q90yf4 pleuronecte 



ALIGNMENTS 



RESULT 1 
Q9TUM8 

ID Q9TUM8 PRELIMINARY; PRT ; 390 AA. 

AC Q9TUM8 ; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

GN TGFB1 . 

US Equus caballus (Horse) . 

OC Eukaryota; Metazoa,- Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Perissodactyla ; Equidae; Equus. 

OX NCBI_TaxID=9796; 

RN [ 1 ] 

RP SEQUENCE FROM N.A. 

RA Nixon A.J., Brower-Toland B.T., Sandell L.J.; 

RT "Molecular cloning of equine transforming growth factor beta 1 reveals 

RT a divergent nucleotide structure that encodes a novel bioactive 



RT peptide among mammalian species. "; 

RL Submitted (AUG-1999) to the EMBL/GenBank/DDBJ databases. 

CC SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF175709; AAD49431.1; -. 

DR HSSP; P01137; 1KLA . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N. 

DR InterPro; I PRO 03911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR0043 8; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 390 AA; 43860 MW; 22 0FE4 0DFCCA6016 CRC64 ; 

Query Match 86.1%; Score 1820; DB 6; Length 390; 

Best Local Similarity 87.4%; Pred. No. 8.8e-154; 

Matches 348; Conservative 12; Mismatches 30; Indels 8; Gaps 1; 

Qy 1 MA P SGLRLL P LLL PLLW LLVLT PGR P AAGL S T C KT I DM ELVKR KRIEAIRGQILS KLRLA 60 

I llllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1 MPPSGLRLLPLLLPLLWLLVLT PGR PAAGLSTCKT I DMELVKR KRIEAIRGQILS KLRLA 60 

Qy 61 SPPSQGDVPPGPLPEAVLALYNSTRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI 12 0 

I MM 1 = 1 MM III I MINI III : MM I MM Ml Ml Mill III Ml IM 

Db 61 SPPSQGEVPPGPLPEAVLALYNSTRAQVAGESAETEPEPEADYYAKEVTRVLMVEKENEI 120 

Qy 121 YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWR 18 0 

I = Ihll II MM 1 1 h h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 llllilllllllhlll 

Db 121 YKTVETGSHS I YMFFNASELRAAVPDPMLLSRAELRLLRLKLSVEQHVELYQKYSNNSWR 18 0 

Qy 181 YLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAI EGFRLSAHSSSDSKDNTLHVEINGFN 24 0 

lllllll 1 1 1 1 1 i 1 1 ! I ! ! 1 1 1 1 ! 1 1 ! : hll Mill lllllll I lllh 

Db 181 YLSNRLLTPSDSPEWLSFDVTGWRQWLSQGGAMEGLRLSAHCPCDSKDNTLRVGINGFS 24 0 

Qy 241 SGRRGDLATIHGMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKN 3 00 

I MMMM 1 1 1 1 1 1 [ 1 1 1 M I M M I II I II I II M M M I Ml Ml 

Db 241 SSRRGDLATIDGMNRPFLLLMATPLERAQQLHSSRHRRALDTNYCSSST EKN 292 

Qy 3 01 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 360 

I M I i 1 1 i I ) 1 1 1 1 1 1 1 1 1 1 1 1 1 ) I i 1 1 i 1 1 1 1 1 1 1 1 1 i 1 1 1 ] I ] 1 1 ) 1 1 1 1 ) 1 1 M 1 1 1 

Db 2 93 CCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASA 352 

Qy 3 61 APCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

IIIIIM 1 1 II II I II I II II II 1 1 1 M II I M I M I 

Db 353 APCCVPQVLEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 390 



RESULT 2 
Q8R4D9 

ID Q8R4D9 PRELIMINARY; PRT ; 368 AA. 

AC Q8R4D9; 

DT 01-JUN-2002 (TrEMBLrel . 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 



DT 01-MAR-2003 (TrEMBLrel . 23, Last annotation update) 

DE Transforming growth factor beta-1 protein (Fragment) . 

GN TGFB1 . 

OS Sigmodon hispidus (Hispid cotton rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Sigmodontinae; 

OC Sigmodon. 

OX NCBI__TaxID=424 1 5 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Blanco J.C., Pletneva L.M., Prince G.A. ; 

RT "Cotton rat cytokines, chemokines, and interferons."; 

RL Submitted (FEB-2002) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF480858; AAL87199.1; 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro ; IPR001839; TGFb. 

DR InterPro; I PRO 01111; TGFb_N . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb__propept ide ; 1. 

DR PRINTS; PR0043 8; GFCYSKNOT . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1; 1. 

FT NON_TER 1 1 

SQ SEQUENCE 368 AA; 41905 MW; A5C912 07B0468B4A CRC64 ; 

Query Match 82.4%; Score 1741; DB 11; Length 368; 

Best Local Similarity 86.4%; Pred. No. 9e-147; 

Matches 325; Conservative 16; Mismatches 27; Indels 8; Gaps 1; 
Qy 23 PGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLPEAVLALYN 82 

1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 II (I I II 1 1 1 1 ( I h 1 1 1 1 1 II 1 1 M 1 1 1 1 

Db 1 PGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGEVPPGPLPEAVLALYN 60 

Qy 83 STRDRVAGESVEPEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSELRE 142 

Illlllllll :|MIIIIMIIIMIIIIIh I I I I I I llhll ||||::|| 
Db 61 STRDRVAGESADPEPEPEADYYAKEVTRVLMVDRNNAI YDKTKDI PHSVYMFFNTSDIRE 120 

Qy 143 AVPEPVLLSRAELRLLRLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWLSFDVTG 202 

lllll lllllllll I I lllllllhlllhlllll lllhhIMIIIIIIII 
Db 121 AVPEPPLLSRAELRLQRFKSNVEQHVELYEKYSNNSWRYLGNRLLSPTDSPEWLSFDVTS 18 0 

Qy 203 WRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATIHGMNRPFLLLMA 262 

llhll : : hill III I Mill lllllll : Mill Ml IMMMMI 
Db 181 WRKWLNQGDG I QGFR FSAHCS CDS KDN I LHVE I NG I S PKRRGDLGTI HDMNRPFLLLMA 240 

Qy 263 TPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKNCCVRQLYI DFRKDLGWKWI HEP 127 

1 1 1 1 1 1 1 i i 1 1 M 1 1 M 1 1 1 M 1 1 M I MIMIIIIIIIIIIIIIIIIIIII 

Db 241 TPLERAQHLHSSRHRRALDTNYCFSST EKNCCVRQLY I DFRKDLGWKWI HEP 292 

Qy 323 KGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKP 382 

!' ! 1 1 1 1 1 1 1 1 1 I'll! ! 1 1 1 1 1 1 ! .' 1 1 1 ! ! I M , 1 1 I M 1 1 , 1 1 1 1 1 1 1 1 ! I 

Db 293 KGYHANFCLGPC P Y I WSLDTQYS KVLALYNQHN PGASAS PCCVPQALEPLP I VYYVGRKP 352 

Qy 3 83 KVEQLSNMIVRSCKCS 398 

III III lllllllll I 



Db 3 53 KVEQLSNMIVRSCKCS 368 



RESULT 3 
Q8JHF5 

ID Q8JHF5 PRELIMINARY; PRT; 379 AA . 

AC Q8JHF5; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel . 22, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

OS Sparus aurata (Gilthead sea bream) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha ; Percif ormes ; Percoidei; 

OC Sparidae; Sparus. 

OX NCBI_TaxID=8175; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Tafalla C. , Aranguren R . , Secombes C.J., Castrillo J.L., Novoa B., 

RA Figueras A. ; 

RT "Molecular characterization of sea bream (Sparus aurata) transforming 

RT growth factor betal."; 

RL Fish and Shellfish Immunol. 0:0-0(2002). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL ; AF510084; AAN03842.1; 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro ; IPR0018 39; TGFb . 

DR InterPro; I PRO 01 111; TGFb_N . 

DR InterPro; IPR003911; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide; 2. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

SQ SEQUENCE 379 AA; 43506 MW; C0C9D3D2FCA29C0E CRC64 ; 

Query Match 40.1%; Score 848; DB 13; Length 379; 

Best Local Similarity 46.1%; Pred. No. 4.1e-67; 

Matches 185; Conservative 71; Mismatches 107; Indels 38; Gaps 15; 

Qy 12 LLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAI RGQI LSKLRL -AS P PSQGDVPP 70 

h |:::: I h : h I I I I h h h I I : I I I I I I I lllillll I II 

Db 3 LVFLMFMVAYTVGK- VSGMSTCKTLDLEMVKKKRI EAI RSQI LSKLRLPTESPQAGD - - E 59 

Qy 71 GPLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKE VTRVLMVESGNQI YDKFK 12 5 

M -MM Ml- : = I = : I MM Ml = I = 
Db 60 EEI PSSLLSLYNSTKEMLKEQQTEVQTDI FTEQEEEEYFA TRVHKFNTTNPV 111 

Qy 126 GTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKV EQHVELYQKYSNDSWRYL 182 

|| :: | II Ihl - : Ih I II hi =1 II Mill I Ml 

Db 112 RTPQNMSMSFNISEIRRSIGDYRLLTTAELRML-IKAPTILDEQRVELYQGLGT-SPRYL 169 

Qy 183 SNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGF- -RLSAHSSSDSKDNTLHVEINGFN 240 

-I : MIIIMI := II = - I II I =| hi 



Db 170 ASRFITNELRDKWLSFDVTETLQNWLKGNDDVQVFQLRLYCDCGRSSDVSTFSFGISGMT 229 

Qy 241 SGRRGDLATIHGMNR- -PFLLLMATPLERAQHLHSSRHRRALDT-NYCFSSTDYKDDDDK 297 

:| Ml | : | : |::| |: | : || :|| :|: :| : | = |: 
Db 230 AG - RGDKAVLDDMTKQPPYI LTMS I PKNVSSHL- TSRKKRSTETKDTCTAQTE 280 

Qy 2 98 EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPG 357 

I I I I I I I I I I I I I I I I I I I ' I I II h hi I llh : :||::|||| I I I I 
Db 281 --TCCWSLYIDFRKDLGWKWIHKPTRYHANYCMGSCTYIWNAENKYSQI1ALYKHHNPG 338 

Qy 358 ASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 398 

III I I I II ' I I I II I h I I I I I: lllllllllhlllll 
Db 339 ASAQPCCVPQALEPLPILYYVGRQHKVEQLSNMIVKSCKCS 379 



RESULT 4 
Q8AXK8 



ID Q8AXK8 PRELIMINARY; PRT; 379 AA. 

AC Q8AXK8 ; 

DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1. 

OS Sparus aurata (Gilthead sea bream) . 

OC Eukaryota; Metazoa; Chorda ta; Craniata; Vertebra ta; Euteleostomi ; 

OC Actinopterygi i ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha ; Acanthopterygii ; Percomorpha ; Perciformes; Percoidei; 

OC Sparidae; Sparus. 

OX N CB I _Ta XlD=8175; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Tafalla C. , Novoa B., Aranguren R. , Figueras A.; 

RT "Molecular cloning and characterization of sea bream (Sparus aurata) 

RT TGF beta 1 . " ; 

RL Submitted (SEP-2001) to the EMBL/ GenBank / DDB J databases. 

DR EMBL; AF424703; AAN76665.1; 

SQ SEQUENCE 379 AA; 43485 MW; A3 06EC387F6DBA7C CRC64 ; 



Query Match 39.9%; Score 844; DB 13; Length 379; 

Best Local Similarity 46.1%; Pred . No. 9.2e-67; 

Matches 185; Conservative 70; Mismatches 108; Indels 38; Gaps 15; 



Qy 12 LLPLLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRL-ASPPSQGDVPP 70 

h I-- I h ^ h I I I I h h h I h I I I I I I I llllllll I II 

Db 3 LVFLMFMVAYTVGK- VSGMSTCKTLDLEMVKKKRI EAI RSQI LSKLRLPTESPQAGD - - E 59 

Qy 71 GPLPEAVLALYNSTRDRVAGESVEPEPE PEADYYAKEVTRVLMVESGNQI YDKFK 125 

:| ::|:|||lh: : = I = = I = h | ||! : ! = 
Ub 60 EEI PSSLLSLYNSTKEMLKEQQTEVQTDI FTEXEEEEYFA TRVHKFNTTNPV 111 

Qy 126 GTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKV EQHVELYQKYSNDSWRYL 182 

II - I II I hi == = Ih lllh I =1 II INN MM 

Db 112 RTPQNMSMSFN I S EI RRS I GDYRLLTTAELRML - 1 KAPTI LDEQRVELYQGLGT - S PR YL 169 

Qy 183 SNRLLAPSDSPEWLSFDVTGVVRQWLTRREAIEGF- -RLSAHSSSDSKDNTLHVEINGFN 24 0 

-I : =1111111 - || : == | II I =| hi 

Db 170 ASRFITNELRDKWLSFDVTETLQNWLKGNDDVQVFQLRLYCDCGRSSDVSTFSFGISGMT 229 



Qy 241 SGRRGDLATIHGMNR- -PFLLLMATPLERAQHLHSSRHRRALDT-NYCFSSTDYKDDDDK 297 

: I I I I I ♦ I : | = = | hi : II :|| =|= =1 = I : h 
Db 230 AG-RGDKAVLDDMTKQPPYILTMSI PKNVSSHL-TSRKKRSTETKDTCTAQTE 280 

Qy 298 EKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVIALYNQHNPG 357 

I I I I llllllllllllllhl lllhhl I Mh = Mh I I I I I I I I 
Db 281 --TCCVRSLYIDFRKDLGWKWIHKPTRYHANYCMGSCTYIWNAENKYSQXLALYICHHNPG 338 

Qy 358 ASAAPCCVPQALEPLPIVTYVGRKPKVEQLSNMIVRSCKCS 398 

III llllllllllllhllllh lllllllllhlllll 
Db 33 9 ASAQPCCVPQALEPLP I LYYVGRQHKVEQLSNMI VKSCKCS 37 9 



RESULT 5 
Q91YU7 

ID Q91YU7 PRELIMINARY; PRT; 412 AA . 

AC Q91YU7; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor, beta 3. 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A. 

RA Strausberg R. ; 

RL Submitted (OCT-2001) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC014690; AAH14690.1; 

DR MGD; MGI: 98727; Tgfb3 . 

DR Inter Pro; I PRO 024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR InterPro; I PRO 03 911; TGF_TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR00438; GFCYSKNOT. 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF__BETA_1 ; 1. 

SQ SEQUENCE 412 AA; 47144 MW; F3EB65D046DF32AD CRC64 ; 

Query Match 3 9.4%; Score 832; DB 11; Length 412; 

best Local Similarity 45.5%; Pred. No. 1.2e-65; 

Matches 190; Conservative 58; Mismatches 122; Indels 48; Gaps 14; 
Qy 15 LLWLLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVPPGPLP 74 

h I =1 : Mil hi * h I h II M I II II I I I I Ml : : I 

Db 9 LWLALLNLATI SLSLSTCTTLDFGH I KKKR VEA I RGQ I LS KLRLTS P PE PS VMT - - H VP 66 



Qy 



75 EAVLALYNSTR DRVAGESVE- - PEPEPEADYYAKEVTRVLMVE SGNQI YDKFKG 126 



Db 67 YQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFDMIQGLAEHNELAVCPKG 126 

Qy 127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVEQHVELYQKYSND S 178 

:: || I : : I III I • I I • : I I : I I : I I 

Db 127 ITSKVFR-FNVSSVEK NGTNLFRAEFRVLRVPNPSSKRTEQRI ELFQI LRPDEHIAK 182 

Qy 179 WRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSKDN- 230 

11= : I : llllllll M - M III: I = II = I = I 

Db 183 QRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEISIHCPCHTFQPNGDILENV 242 

Qy 231 - -TLHVEINGFNS GRRGDLATIHGM NRPFLLLMATPLER- -AQHLHSSRHRRAL 280 

: I | | | | : =11-11 I I • I I M I 

Db 243 HE VME I KFKG VDNEDDHGRGDLGRLKKQKDHHNPHL I LMM I P PHRLDS PGQGSQRKKRAL 302 

Qy 281 DTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSL 340 

MINI : hlllll MlllhlMllhlllllhllll IMIh I 

Db 3 03 DTNYCFRNL EENCCWPLYIDFRQDLGWKWVHEPKGYYANFCSGPCPYLRSA 354 

Qy 341 DTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMI VRSCKCS 3 98 

II =1 II III 1 1 MhllllM MM hlllll I M M I M h h I II 1 1 

Db 355 DTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPKVEQLSNMWKSCKCS 412 

RESULT 6 
Q9PWA9 

ID Q9PWA9 PRELIMINARY; PRT; 3 82 AA. 

AC Q9PWA9 ; 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel . 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

GN TGF-BETA . 

OS Morone chrysops x Morone saxatilis (white bass x striped bass) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha / Acanthopterygii ; Percomorpha; Percif ormes ; Percoidei; 

OC Moronidae / Morone . 

OX NCBI_TaxID=45352; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TISSUE=KIDNEY; 

RX MEDLINE=20394636; PubMed= 1 0 93 8723 ; 

RA Harms C.A. , Kennedy- Stoskopf S., Home W.A., Fuller F.J., 

RA Tompkins W.A.F.; 

RT "Cloning and sequencing hybrid striped bass (Morone saxatilis x M. 

RT chrysops) transforming growth factor-beta (TGF-beta) , and development 

RT of a reverse transcription quantitative competitive polymerase chain 

RT reaction (RT-qcPCR) assay to measure TGF-beta mRNA of teleost fish."; 

RL Fish Shellfish Immunol. 10:61-85(2000). 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 

CC RESPONSE. MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS. 

CC -!- SUBUNIT: HOMODIMER, DI SULFIDE -LINKED (BY SIMILARITY) . 

CC -!- TISSUE SPECIFICITY: HIGHER LEVELS FOUND IN MONONUCLEAR CELLS FROM 

CC PERIPHERAL BLOOD THAN IN SPLEEN OR ANTERIOR KIDNEY. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY . 

DR EMBL; AF140363; AAD46997.1; -. 

DR HSSP; P01137; 1KLA. 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



InterPro; IPR0024 00; GF_cysknot . 

InterPro; IPR001839; TGFb . 

InterPro; IPR001111; TGFb_N . 

InterPro; IPR003 911; TGFJTGFb. 

Pfam; PF00019; TGF-beta; 1. 

Pfam; PF00688; TGFb_propeptide ; 1. 

PRINTS; PR00438; GFCYSKNOT. 

PRINTS; PRO 1423; TGFB ETA . 

ProDom; PD000357; TGFb; 1. 

SMART; SM00204 ; TGFB; 1. 

PROSITE; PS00250; TGF_BETA_1 ; 1. 

Growth factor; Mitogen; Glycoprotein; Signal. 



SIGNAL 


1 


? 


POTENTIAL. 


PROPEP 


? 


270 




CHAIN 


271 


382 


TRANSFORMING GROWTH FACTOR BETA. 


DISULFID 


278 


286 


BY SIMILARITY. 


DISULFID 


285 


348 


BY SIMILARITY. 


DISULFID 


314 


379 


BY SIMILARITY. 


DISULFID 


318 


381 


BY SIMILARITY. 


DISULFID 


347 


347 


INTERCHAIN (BY SIMILARITY) . 


CARBOHYD 


73 


73 


N-LINKED (GLCNAC . . .) (POTENTIAL) 


CARBOHYD 


108 


108 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


113 


113 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


124 


124 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


CARBOHYD 


259 


259 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SITE 


235 


237 


CELL ATTACHMENT SITE (POTENTIAL) . 


) SEQUENCE 


382 AA; 


43846 


MW; 3124D8C34EA74D72 CRC64 ; 


Query Match 




39.0% 


Score 825.5; DB 13; Length 382; 


Best Local Similarity 


45.9% 


Pred. No. 4.2e-65; 


Matches 185; 


Conservative 


66; Mismatches 107; Indels 45; 



15; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 



15 LLWLLVLTPGRPAAGLSTCKTI DMELVKRKR I EAI RGQI LSKLRL AS PPSQGD VPPG 71 

h -I I I :|:||llhhhlhlMIIII lllMIM I II 

6 LMLWVYTVGN- VSGMSTCKTLDLEMVKKKR I EAI RSQI LSKLRLPKEPEPDQAGDEEEI 64 

72 PLPEAVLALYNSTRDRVAGESVEPE PEPEADYYAKEVTRVLMVESGNQI YDKFKG 126 

I I :|:|||lh: = = I = : I : h I I = = I I 
65 PTP--LLSLYNSTKEMLKEQQTEVQTDISTEQEEEEYFAKVLHKFNMTRKNN 114 

127 TPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLKV- - EQHVELYQKYSNDSWRYLSN 184 

I : I II Ihlhl : lh lllhl I = II II I I I I Mi- 

ll 5 TDTTKKMFFNI SEI RESVGDYRLLTSAELRMLI KKTTI YDEQRVELYSGL-GDSPRYLAS 173 

185 RLLAPSDS PEWLSFDVTGWRQWLTRREAI EGF - - RLSAHSSSDSKDNTL - HVE I NGFNS 241 

I = :|llllll - II : HI II = I h hi = 

174 RFITNKWKDKWLSFDVTKTLQDWLKGTDDEQGFQLRLFCECNKVSAGETIFKFGISGIDP 233 

242 GRRGDLATIHGMNR- -PFLLLMATPLERAQHLHS SRHRRALDT-NYCFSSTDYKDDD 295 

I III : : : |::| h I |:: I || :|= : | : | : | : 
234 G-RGDTGPMQLLTQQPPYILTMSIP QN I SS PSTSRKKRSTETKDVCTAQTE 283 

296 DKEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKV1ALYNQHN 355 

UN llllllllllllllhl Ihlhhl I llh : :||::|||l II 
284 TCCVRSLYIDFRKDLGWKWIHKPTGYNANYCMGSCTYIWNAENKYSQILALYKHHN 339 

356 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 3 98 



Mill IIIIIIMIIIIMIIIh lllllllllhlllll 

Db 340 PGASAQPCCVPQALEPLP I LYYVGRQHKVEQLSNMI VKSCKCS 382 

RESULT 7 
093449 

ID 093449 PRELIMINARY; PRT; 382 AA . 

AC 093449; Q91217; 

DT 01 -NOV- 1998 (TrEMBLrel . 08, Created) 

DT 01-NOV-1998 (TrEMBLrel. 08, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

GN TGF-BETA OR TGF . 

OS Oncorhynchus mykiss (Rainbow trout) (Salmo gairdneri) . 

OC Eukaryota; Metazoa; Chordata; Craniata ; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; 

OC Protacanthopterygii; Salmonif ormes ; Salmonidae; Oncorhynchus. 

OX NCBI_TaxID=8 022 ; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC TI SSUE=LEUKOCYTE ; 

RX MEDLINE=99242020; PubMed=l 0227481 ; 

RA Daniels G.D., Secombes C. J. ; 

RT "Genomic organisation of rainbow trout, Oncorhynchus mykiss TGF- 

RT BETA."; 

RL Dev. Comp. Immunol. 23:139-147(1999). 

RN [2] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RC TISSUE=LEUKOCYTE; 

RX MEDLINE=98390168; PubMed=9722 928 ; 

RA Hardie L.J., Laing K.J., Daniels G.D. , Grabowski P.S., Cunningham C- , 

RA Secombes C.J. ; 

RT "Isolation of the first piscine transforming growth factor beta gene: 

RT analysis reveals tissue specific expression and a potential regulatory 

RT sequence in rainbow trout (Oncorhynchus mykiss)."; 

RL Cytokine 10:555-563(1998). 

CC -!- FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 

CC RESPONSE . MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS . 

CC SUBUNIT: HOMODIMER, DISULFIDE -LINKED (BY SIMILARITY) . 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN BLOOD LEUKOCYTES, KIDNEY 

CC MACROPHAGES, BRAIN, GILL AND SPLEEN BUT NOT IN LIVER. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AJ007836; CAA07707.1; 

DR EMBL; X99303; CAA67685.1; -. 

DR HSSP; P01137; 1KLA , 

DR InterPro; IPR002400; GF__cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFbJtf. 

DR InterPro; IPR003911; TGFJTGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam,- PF00688; TGFb_propeptide; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein; Signal. 



FT 


S I GNAL 


1 


2 0 


POTENTIAL . 


FT 


PROPEP 


2 1 


270 




FT 


CHAIN 


271 


382 


TRANSFORMING GROWTH FACTOR BETA. 


FT 


DISULFID 


278 


286 


BY SIMILARITY. 


FT 


DISULFID 


285 


348 


BY SIMILARITY. 


FT 


DI SULFID 


314 


379 


BY SIMILARITY. 


FT 


DISULFID 


318 


381 


BY SIMILARITY. 


FT 


DISULFID 


347 


347 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


76 


76 


N -LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


X 1 D 


lib 


IN -J_j1I\1\Cj1J ^ OIjVwIN/iL- . . . ) \ r U 1 EjJN 1 X AL ) 


FT 


CARBOHYD 


125 


125 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CONFLICT 


237 


237 


N -> D (IN REF. 2) . 


FT 


CONFLICT 


345 


345 


Q -> H (IN REF. 2) . 


FT 


CONFLICT 


371 


372 


LS -> VP (IN REF. 2) . 


FT 


CONFLICT 


377 


377 


K -> M (IN REF. 2) . 


SQ 


SEQUENCE 


382 AA; 


44136 MW; 


93BD4D3540084B92 CRC64 ; 


Query Match 




39 . 0%; 


Score 823.5; DB 13; Length 382; 



Best Local Similarity 46.9%; Pred . No. 6.3e-65; 

Matches 180; Conservative 58; Mismatches 107; Indels 39; Gaps 12; 

Qy 30 LSTCKTI DMELVKRKRI EAI RGQI LSKLRLAS PP SQGDVPPGPLPEAVLALYNSTRD 86 

:||lh:|:||||lllllllllllllllll I * I I =1 = = --1111 = 

Db 23 MSTCKSLDLELVKRKR I EAI RGQI LSKLRLPKEPE I DQEGDTE - - EVPASLMS I YNSTVE 80 

Qy 87 RVAGESVE PEPEPEADYYAKEVTRVLMVESGNQI YDKFKGTPHSLYMLFNTSEL 14 0 

= = I I : I II hi I II = 1=11 = I : III Ih 

Db 81 -LSEEQVHTYI PSTQDAEEEA- YFAKEVHKFNMKQSENT SKHQI - -LFNMSEM 12 9 

Qy 141 REAVPEPVLLSRAELRLL RLKLKVEQHVELYQKYSNDSWRYLSNRLLAPSDSPEWL 196 

I = llhllllll I II =11 I: | Ml : :: : |: 

Db 130 RSVLGTDRLLSQAELRLLIKNHGLLDDSEQRLELYRGV-GDKARYLKSHFVSKEWANRWV 188 

Qy 197 SFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEINGFNSGRRGDLATI--HGMN 254 

Mill = =11 =11 = 1 =1 = 1 I Ih 1 = 

Db 189 SFDVTQTLNEWLQGAGEEQGFQLKLPCDCGKPMEEFRFKI SGMNK- LRGNTETLAMKMPS 247 

Qy 2 55 RPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDL 314 

= 1 =111= hll I I = h I II = = IMhlllllllll 

Db 248 KPHILLMSLPVERHSQLSSRKKRQTTTEEIC SDKSESCCVRKLYI DFRKDL 298 

Qy 315 GWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI 3 74 

llllllll II IhhIM llh = =11 = 11111 MINN MINI MINI 

Db 2 99 GWKWIHEPTGYFANYCIGPCTYIWNTENKYSQVLALYKHHNPGASAQPCCVPQVLEPLPI 358 

Qy 375 VYYVGRKPKVEQLSNMI VRSCKCS 398 

= I I I I h I I I I I I I I I h I h I I 
Db 359 I Y YVGRQHKVEQLSNM I VKS CRCS 382 



RESULT 8 
Q91VP5 

ID Q91VP5 PRELIMINARY; PRT; 414 AA . 

AC Q91VP5; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 



DE Similar to transforming growth factor, beta 2. 

GN TGFB2 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 

RC TISSUE=Breast tumor; 

RA Strausberg R. ; 

RL Submitted (JUL-2001) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY . 

DR EMBL; BC011170; AAH11170.1; -. 

DR MGD; MGI: 98726; Tgfb2 . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR InterPro; IPR003911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR0043 8; GFCYSKNOT . 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204 ; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1; 1. 

SQ SEQUENCE 414 AA; 4 7588 MW; DB37A7C38881F286 CRC64 ; 

Query Match 38.7%; Score 818.5; DB 11; Length 414; 

Best Local Similarity 44.4%; Pred. No. 2e-64; 

Matches 189; Conservative 61; Mismatches 121; Indels 55; Gaps 16; 

Qy 12 LLPLLWLLVLTPGRPAAGLSTCKTI DMELVKRKRI EAIRGQI LSKLRLASPPSQGDVP - P 70 

:| I I II I I I I I hlh llllllllllllllhl III III 

Db 5 VLSTFLLLHLVP- -VALSLSTCSTLDMDQFMRKRI EAIRGQI LSKLKLTS PPE- -DYPEP 60 

Qy 71 GPLPEAVLALYNSTRD RVAGESVEPEPE- PEADYYAKEVTRVLM VESGNQI YD 122 

= 1 h-llllll : : : I I : HMIM == I : I I I 

Db 61 DEVPPEVISIYNSTRDLLQEKASRRAAACERERSDEEYYAKEVYKIDMPSHLPSENAIPP 120 

Qy 123 KFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKV- EQHVELYQ K 173 

I h : | : | : : I HI h II I -'I II HIM 

Db 121 TF Y - RPYFRI VRFDVSTMEKNASN LVKAEFRVFRLQNPKARVAEQRI ELYQI LKSKD 176 

Qy 174 YSNDSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSS 225 

:: : ||= : = = : MINIM MM I := I MM I 

Db 177 LTSPTQRY I DSKWKTRAEGEWLSFDVTDAVQEWLHHKDRNLGFKI SLHCPCCTFVPSNN 236 

Qy 226 DSKDNTLHVE INGFNSGRRGDLATI HGMNR PFLLLMATPLERAQHLHS 273 

• i I 1 = 1 == MM I MM I I : I 

Db 237 YIIPNKSEELEARFAGIDGTSTYASGDQKTIKSTRKKTSGKTPHLLLMLLPSYRLESQQS 296 

Qy 274 SRH-RRALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLG 332 

M Mill III : : llhl MMhMMMMIMIIhMM I 

Db 297 SRRKKRALDAAYCFRNV QDNCCLRPLYIDFKRDLGWKWIHEPKGYNANFCAG 348 

Qy 333 PCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIV 3 92 

I I I : I I MhMlhlll II I I I : I I I I I MM hlhl IhlllMIII 



Db 349 ACPYLWSSDTQHTKVLSLYNTINPEASASPCCVSQDLEPLTILYYIGNTPKIEQLSNMIV 408 



Qy 3 93 RSCKCS 3 98 

HUM 

Db 4 09 KSCKCS 414 



RESULT 9 
Q9PTQ2 

ID Q9PTQ2 PRELIMINARY; PRT; 376 AA . 

AC Q9PTQ2 / 

DT 01-MAY-2000 (TrEMBLrel . 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta precursor. 

OS Cyprinus carpio (Common carp) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii ; Teleostei; Ostariophysi ; Cyprinif ormes ; 

OC Cyprinidae; Cyprinus. 

OX NCBI_TaxID=7962; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Yin Z . , Kuang J. ; 

RT "Molecular cloning of carp transforming growth factor beta 1 . " ; 

RL Submitted (MAR-1999) to the EMBL/ GenBank/DDB J databases. 

CC FUNCTION: IS LIKELY TO BE AN IMPORTANT CYTOKINE REGULATING IMMUNE 

CC RESPONSE . MAY ALSO HAVE A ROLE IN OTHER PHYSIOLOGICAL SYSTEMS. 

CC SUBUNIT: HOMODIMER, D I SULF I DE - LI NKED (BY SIMILARITY) . 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF136947; AAF22573.1; 

DR HSSP; P01137; 1KLA . 

DR InterPro; IPR001839; TGFb . 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; IPR003911; TGFJTGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 



DR 


PRINTS; 


PR01423; 


TGFB ETA. 




DR 


ProDom; 


PD000357; 


TGFb; 1. 




DR 


SMART; SM00204; TGFB; 1. 




DR 


PROSITE; 


PS00250; 


TGF BETA 1; 


1 . 


KW 


Growth factor; Mitogen; Glycoprotein; Signal. 


FT 


SIGNAL 


1 


22 


POTENTIAL. 


FT 


PROPEP 


23 


264 




FT 


CHAIN 


265 


376 


TRANSFORMING GROWTH FACTOR BETA. 


FT 


DISULFID 


272 


280 


BY SIMILARITY. 


FT 


DISULFID 


308 


373 


BY SIMILARITY. 


FT 


DISULFID 


312 


375 


BY SIMILARITY. 


FT 


DISULFID 


341 


341 


INTERCHAIN (BY SIMILARITY) . 


FT 


CARBOHYD 


76 


76 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


125 


125 


N -LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


167 


167 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


SITE 


230 


232 


CELL ATTACHMENT SITE (POTENTIAL) . 


SQ 


SEQUENCE 


376 AA 


; 43329 MW; 


7F7FC4DA58B69681 CRC64 ; 



Query Match 37.3%; Score 788.5; DB 13; Length 376; 

Best Local Similarity 44.5%; Pred. No. 8.1e-62; 

Matches 179; Conservative 61; Mismatches 125; Indels 37; Gaps 14; 



Qy 6 LRLLPLLLPLLWLLVXTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQ 65 

:h III I II : I I I I : h I h I I I I I I I I I I I I I I I I I h I 

Db 1 MRVESLLLALQCLLGFV- -HYSGALSTCSPLDLELIKRKRIEAIRGQILSKLRLSKEPEV 58 

Qy 66 GDVPPGP - LPEAVLALYNST RDRVAGES VEPEPEP - EADYYAKEVTR - - VLMVESGN 118 

:| ::::|||| = | :|: :| | :|||||| : = ::| 

Db 5 9 DEEKESQNI PAELI SVYNSTVELNEEQAAPPEQPKEDPVEEEYYAKEVHKFTI KLMEKNP 118 

Qy 119 QI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRLKLK- -VEQHVELYQKYSN 176 

III : II : = = : : -h Mill II HIM I 

Db 119 DKF LWFN I TD I SQTLGLNR 1 1 SQ VELRLL I TTF PDGS EQRLELYQVI GN 167 

Qy 177 DSWRYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAHSSSDSKDNTLHVEI 236 

I Ml :| = = MIIIMI ■ ' II I II M M I : I = I -I 

Db 168 KS -RYLESRFI - - PNQRKWLSFDVTQTLKDWLQRSEAEQGFQLKMADNCDPQ- KTFQLKI 223 

Qy 23 7 NGFNSGRRGDLATIH-GMNRPFLLLMATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDD 295 

I 111 h I II :| = |= lh I I 1= I 
Db 224 PGLVL-VRGDTETLAVNMPRPHILVMSLPLDGNNSSKSRRKRQTETDQVC T 273 

Qy 296 DKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHN 355 

II MM I : ' I I I I I I I I I ' I I I IMIMI I I =:|= : : I I : I I I I I II 
Db 274 DKSDGCCVRSLYIDFRKDLGWKWIHEPSGYYANYCTGSCSFVWTSENKYSQVLALYKHHN 333 

Qy 356 PGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKC 3 97 

HIM II III I MM Mill: I i 1 I I i I I I ^ I I 
Db 3 34 PGASAQPCRVPQVLNPLPIFYYVGRQHKVEQLSNMI VKTCKC 375 

RESULT 10 
Q9ERB7 

ID Q9ERB7 PRELIMINARY ; PRT; 399 AA. 

AC Q9ERB7; 

DT 01-MAR-2001 (TrEMBLrel . 16, Created) 

DT 01-MAR-2001 (TrEMBLrel . 16, Last sequence update) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last annotation update) 

DE Transforming growth factor-beta 2 (Fragment) . 

OS Mesocricetus auratus (Golden hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Cricetinae; 

OC Mesocricetus. 

OX NCBI_TaxID=10036; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Ramesh G. , Kondaiah P. , Seshagiri P.B.; 

RT "Differential expression and selective localization of transforming 

RT growth factor-beta isoforms in the hamster uterus during estrous 

RT cycle . " ; 

RL Submitted (AUG-2000) to the EMBL/ GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AY007214; AAG02247.1; -. 

DR HSSP; P08112; 2TGI . 

DR InterPro; IPR00183 9; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; I PRO 03 9 11; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 



DR Pfam; PF00688; TGFbjpropept ide ; 1. 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

FT NON_TER 1 1 

FT NONJTER 3 99 3 99 

SQ SEQUENCE 399 AA; 46078 MW; A6FF8E65EAFD514 8 CRC64 ; 

Query Match 35.8%; Score 757.5; DB 11; Length 399; 

Best Local Similarity 43.2%; Pred. No. 5.2e-59; 

Matches 178; Conservative 58; Mismatches 121; Indels 55; Gaps 16; 

Qy 18 LLVLTPGRPAAGLSTCKTIDMELVKRKRIEAIRGQILSKLRLASPPSQGDVP-PGPLPEA 76 

II I I I I I I I I : I I : llllllllllllllhl Ml IN 

Db 4 LLHLVP - - VALSLSTCSTLDMDQFMRKR I EAI RGQILSKLKLTSPPE - -DYPEPDEVPPE 59 

Qy 77 VLALYNSTRD RVAGESVEPEPE- PEADYYAKEVTRVLM VESGNQI YDKFKGTP 128 

h-MIMI = = = II = =111111 := I Mill 

Db 60 VI S I YNSTRDLLQEKASRRAAACERERSDEEYYAKEVYKI DMPSHFPSENAI PPTFY -RP 118 

Qy 129 HSLYMLFNTSELREAVPEP VLLSRAELRLLRL KLKV- EQHVELYQ KYSNDSW 179 

: : | : | : : I Ml h II I =| II Mill :: = 

Db 119 YFRI VRFDVSMMEKNASN LVKAEFRVFRLQNPKARVAEQRI ELYQILKSKDLTSPTQ 175 

Qy 180 RYLSNRLLAPSDSPEWLSFDVTGWRQWLTRREAIEGFRLSAH SSSDSK 228 

Ih :::: MINIM I Ml =: I M M I : M 

Db 176 RYIDSKWKTRAEGEWLSFDVTDAVHEWLHHKDRNLGFKISLHCPCCTFVPFNNNIIPNK 235 

Qy 229 DNTLHVE 1 NGFNSGRRGDLAT I HGMNR PFLLLMATPLERAQHLHSSRH-RR 278 

I hi : I II I MM I h hi M 

Db 236 SEELEARFAGI DGTSQHSSGHQETI KSTRKKNSGKTPHLLLMLLPSYRLESQQSNRRKKR 2 95 

Qy 279 ALDTNYCFSSTDYP<DDDDKEKNCCWQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 338 

III Ml : : llhl 1 1 1 MM Ml Ml III II IMMM I HIM 

Db 2 96 ALDAAYCFRNV QDNCCLRPLYI DFKRDLGWKWI HEPKGYNANFCAGACPYLW 34 7 

Qy 33 9 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNM 3 90 

I MhMMMII II llhllll MM hlhl Ihllllll 
Db 348 SSDTQHTKVLSLYNTI NPEASAS PCCVSHDLEPLTI LYYI GNTPKI EQLSNM 3 99 



RESULT 11 
Q99K17 

ID Q99K17 PRELIMINARY; PRT; 362 AA. 

AC Q99K17; 

DT 01-JUN-2001 (TrEMBLrel . 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT Ul-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Similar to transforming growth factor, beta 3 (Fragment) . 

GN TGFB3 . 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent la ; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N . A . 



RA Strausberg R.; 

RL Submitted (MAR-2001) to the EMBL/ GenBank/ DDB J databases. 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; BC005513; AAH05513.1; 

DR HSSP; P10600; 1TGJ. 

DR MGD; MGI : 98 727; Tgf b3 . 

DR InterPro; I PRO 024 00; GF_cysknot . 

DR InterPro; IPR001839; TGFb. 

DR InterPro; IPR001111; TGFb_N . 

DR InterPro; IPR003911; TGF_TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR0043 8; GFCYSKNOT. 

DR PRINTS; PR01423; TGFB ETA . 

DR ProDom; PD0003 57; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS 00250; TGF_BETA_1 ; 1. 

FT NONJTER 1 1 

SQ SEQUENCE 362 AA; 41486 MW; 08 08E4 61 8 0FDAE7 0 CRC64 ; 

Query Match 34.2%; Score 724; DB 11; Length 362; 

Best Local Similarity 44.3%; Pred . No. 4.4e-56; 

Matches 166; Conservative 52; Mismatches 109; Indels 48; Gaps 14; 
Qy 58 RLASPPSQGDVPPGPLPEAVLALYNSTR DRVAGESVE - - PEPEPEADYYAKEVTRVL 112 

h Ml • M 1 1 llllll I ••II I : 1-1 1 MM- = 

Db 2 RVGSPPEPSVMT- -HVPYQVLALYNSTRELLEEMHGEREEGCTQETSESEYYAKEIHKFD 59 

Qy 113 MVE SGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL KLKVE 165 

I- I- II :: || | : : | ||| |:||: : | 

Db 60 MIQGLAEHNELAVCPKGITSKVFR-FNVSSVEK NGTNLFRAEFRVLRVPNPSSKRTE 115 

Qy 166 QHVELYQKYSND SWRYLSNRLLAPSDS PEWLSFDVTG WRQWLTRREAI EGFRLSA 221 

I : I h I I II* • I : I | | r | | |||: | 

Db 116 QRIELFQILRPDEHIAKQRYIGGKNLPTRGTAEWLSFDVTDTVREWLLRRESNLGLEISI 175 

Qy 222 H SSSDS KDN TLH VE I NGFNS GRRGDLAT I HGM NRPFLLLMATPL 265 

I = I M : | :: MM : I I - ! I I 

Db 176 HCPCHTFQPNGDILENVHEVMEIKFKGVDNEDDHGRGDLGRLKKQKDHHNPHLILMMIPP 235 

Qy 266 ER--AQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKNCC*WQLYIDFRKDLGWKWIHEPK 323 

I = II MINIUM : IMMM MlllhllllMMIM 

Db 236 HRLDS PGQGSQRKKRALDTNYCFRNL EENCCVRPLYIDFRQDLGWKWVHEPK 287 

Qy 324 GYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPK 383 

IMMM 1 1 1 1 M I M M II Ml 1 1 1 1 1 = 1 1 f 1 1 1 MM IMMM II 

Db 2 88 GYYANFCSGPCPYLRSADTTHSTVLGLYNTLNPEASASPCCVPQDLEPLTILYYVGRTPK 34 7 

Qy 384 VEyLb'NMIVRSCKCS 3 98 

MIMMMMMM 

Db 34 8 VEQLSNMWKSCKCS 362 



RESULT 12 
Q08714 

ID Q08714 PRELIMINARY; PRT; 130 AA . 

AC Q08714; 070331; 



DT 01-NOV-1996 (TrEMBLrel . 01, Created) 

DT 01-NOV-1996 (TrEMBLrel . 01, Last sequence update) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFB1. 

OS Mesocricetus auratus (Golden hamster) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Cricetinae; 

OC Mesocricetus . 

OX NCBI_TaxID=10036; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=LVG (SYR) ; 

RX MEDLINE=93304479; PubMed=83 17544 ; 

RA Wong D.T., Donoff R.B., Yang J. , Song B.Z., Matossian K. , Nagura N., 

RA Elovic A . , McBride J., Gallagher G. , Todd R. ; 

RT "Sequential expression of transforming growth factors alpha and beta 1 

RT by eosinophils during cutaneous wound healing in the hamster."; 

RL Am. J. Pathol. 143:13 0-142(1993). 

RN [2] 

RP SEQUENCE OF 26-115 FROM N.A. 

RC STRAIN=SYRIAN; TISSUE=SPLEEN; 

RX MEDLINE=98234044 ; PubMed=9573 100 ; 

RA Melby P.C., Tryon V.V. , Chandrasekar B., Freeman G.L.; 

RT "Cloning of Syrian hamster (Mesocricetus auratus) cytokine cDNAs and 

RT analysis of cytokine mRNA expression in experimental visceral 

RT leishmaniasis . " ; 

RL Infect. Immun. 66:2135-2142(1998). 

CC -!- FUNCTION: TGF-BETA 1 IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 
CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 

CC -!- SUBUNIT: HOMODIMER, DISULFIDE-LINKED . 

CC -!- SIMILARITY: TO OTHER GROWTH FACTORS OF THE TGF-BETA FAMILY. 

DR EMBL; X60296; CAA42838.1; 

DR EMBL; AF046214; AAC40099.1; -. 

DR HSSP; P01137; 1KLA. 

DR InterPro; IPR001839; TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS0025 0; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 



FT 


NON_TER 


1 


1 




FT 


PROPEP 


<1 


18 




FT 


CHAIN 


19 


130 


TRANSFORMING GROWTH FACTOR 


FT 


UlbULFID 


25 


34 


BY SIMILARITY. 


FT 


DISULFID 


33 


96 


BY SIMILARITY. 


FT 


DISULFID 


66 


129 


BY SIMILARITY. 


FT 


DISULFID 


95 


95 


INTERCHAIN (BY SIMILARITY) 


FT 


CONFLICT 


93 


93 


G -> S (IN REF. 2) . 


SQ 


SEQUENCE 


130 AA; 


14997 


MW; 8B41DD6CF39CCA77 CRC64 ; 



Query Match 32.9%; Score 696; DB 11; Length 130; 

Best Local Similarity 92.0%; Pred. No. 3.1e-54; 



Matches 127; Conservative 0; Mismatches 3; Indels 8; Gaps 1; 



Qy 261 MATPLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIH 320 

M II I II II 1 1 1 1 M I M M I II II I II lllllllllllllllllllllll 

Db 1 MATPLERAQHLQSSRHRRALDTNYCFSST EKNCCVRQLYIDFRKDLGWKWIH 52 

Qy 321 EPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI VYYVGR 38 0 

1 1 M 1 1 1 M: I III .' I U Ml 1 1 II M 1 1 1 1 1 1 1 1 1 M , lllllllllllllllllll 

Db 53 EPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAGPCCVPQALEPLPI VYYVGR 112 

Qy 381 KPKVEQLSNMIVRSCKCS 398 

1 1 [ 1 1 1 1 1 1 1 1 r 1 1 in 

Db 113 KPKVEQLSNM I VRS YKCS 13 0 



RESULT 13 
Q95N80 



ID Q95N80 PRELIMINARY; PRT; 124 AA . 

AC Q95N80; 

DT 01-DEC-2001 (TrEMBLrel . 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 1 (Fragment) . 

OS Canis familiaris (Dog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Carnivora; Fissipedia; Canidae; Canis. 

OX NCBI_TaxID=9615; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Fonfara S., Groene A., Baumgaertner W.; 

RT "Sequence of canine transforming growth factor beta 1 mRNA in DH82- 

RT cells."; 

RL Submitted (FEB-2001) to the EMBL/ GenBank/DDBJ databases. 

CC SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF349538; AAK54072.1; 

DR InterPro; IPR001839; TGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR ProDom; PD0 00357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS002 50; TGF_BETA_1 ; 1. 

FT NONJTER 1 1 

FT NON__TER 124 124 

SQ SEQUENCE 124 AA; 14329 MW; 2 1D18 52 18E5556DB CRC64 ; 



Query Match 32.1%; Score 67 9; DB 6; Length 124; 

Best Local Similarity 93.2%; Pred. No. 9.6e-53; 

Matches 123; Conservative 0; Mismatches 1; Indels 8; Gaps 1; 

Uy 264 PLERAQHLHSSRHRRALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPK 323 

Illlllllllll lllllllllllll II I'll I I III II I II I I Ml I II III 

Db 1 PLERAQHLHSSRQRRALDTNYCFSST EKNCCVRQLY I DFRKDLGWKWI HEPK 52 

Qy 324 GYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPK 383 

1 1 1 II M I i:i III M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II Ml 1 1 1 1 1 1 li 1 1 1 III 

Db 53 GYHANFCLGPCPYI WSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLP I VYYVGRKPK 112 

Qy 3 84 VEQLSNMIVRSC 3 95 



Illlllllllll 

Db 113 VEQLSNMIVRSC 124 

RESULT 14 
Q98854 

ID Q98854 PRELIMINARY; PRT; 361 AA. 

AC Q98854; 

DT 01-FEB-1997 (TrEMBLrel . 02, Created) 

DT 01-FEB-1997 (TrEMBLrel . 02, Last sequence update) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Transforming growth factor beta 2 (TGF-beta 2) (Fragment) . 

GN TGFB2 . 

OS Cyprinus carpio (Common carp) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi ; Cyprinif ormes ; 

OC Cyprinidae; Cyprinus. 

OX NCBI_TaxID=7962; 

RN [ 1 ] 

RP SEQUENCE FROM N . A . 

RC TISSUE=HEART; 

RX MEDLINE=97354301; PubMed=9210595 ; 

RA Sumathy K. , Desai K.V., Kondaiah P.; 

RT "Isolation of transforming growth factor-beta2 cDNA from a fish, 

RT Cyprinus carpio by RT-PCR."; 

RL Gene 191:103-107(1997). 

CC -!- FUNCTION: TGF-BETA 2 HAS SUPPRESSIVE EFFECTS ON INTERLEUKIN-2 
CC DEPENDENT T-CELL GROWTH. 

CC -!- SUBUNIT: HOMODIMER, D I SULF I DE - LI NKED (BY SIMILARITY). 

CC -I- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL ; U66874; AAB62983.1; 

DR HSSP; P08112; 2TGI . 

DR Inter Pro; I PRO 0183 9; TGFb. 

DR InterPro; IPR001111; TGFb_N. 

DR InterPro; I PRO 03 9 11; TGFJTGFb. 

DR Pfam; PF00019; TGF-beta; 1. 

DR Pfam; PF00688; TGFb_propept ide ; 1. 

DR PRINTS; PR01423; TGFBETA . 

DR ProDom; PD0 00357; TGFb; 1. 

DR SMART; SM002 04; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 



FT 


NONJTER 


1 


1 




FT 


PROPEP 


<1 


257 




FT 


CHAIN 


258 


361 


TRANSFORMING GROWTH FACTOR BETA 2. 


FT 


DISULFID 


264 


273 


BY SIMILARITY. 


FT 


DISULFID 


272 


335 


BY SIMILARITY. 


FT 


DISULFID 


334 


334 


INTERCHAIN (BY SIMILARITY) . 


b'T 


CARBOHYD 


30 


30 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


98 


98 


N-LINKED (GLCNAC. . . ) (POTENTIAL) 


FT 


CARBOHYD 


199 


199 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


NONJTER 


361 


361 




SQ 


SEQUENCE 


361 AA; 


41931 


MW; 94D930FA970A3FD3 CRC64 ; 



Query Match 32.0%; Score 677; DB 13; Length 361; 

Best Local Similarity 41.2%; Pred . No. 6.7e-52; 

Matches 155; Conservative 54; Mismatches 115; Indels 52; Gaps 



Qy 52 QILSKLRLASPPSQGDVPPGP- -LPEAVLALYNSTRDRVAGESVEPEPEPE ADYY 104 

III ||:h II =: I I = -IMIMM = == I I : I I 

Db 1 QI LCKLKLSCPP EI YPEPEEVSRDI I AI YNSTRDLLQEKANERAATCERQRTGEEYY 57 

Qy 105 AKEVTRVLM VESGNQI YDKFKGTPHSLYMLFNTSELREAVPEPVLLSRAELRLLRL- 160 

MM I : I I I h = h I : I MM h II 

Db 58 AKEVHKI DMQPFYPAEN VI PTKHY - NPYFRRLRFDVSSMEKNASN LVKAELRI FRLQ 113 

Qy 161 --KLKV-EQHVELYQ KYSNDSWRYLSNRLIAPSDSPEWLSFDVTGVVRQWLTRRE 212 

I : I II Mill =: = Ih IMIMM I Ml h 

Db 114 N PKARVS EQR I ELYQ I LGHKDLTS PTQR Y I DSKWRTRTEGEWLS FDVTEAVSEWLLHRD 173 

Qy 213 A I EGFRLSAHSSS DSKDNTLHVEINGFNSG- -RRGDLATI HGMNR 255 

I I : : I I M I I : III I 

Db 174 RNNGFKISLHCPCCTFVPSNNYI I PNKSEELEARFAGIDDSFVHGGDLKMFKKRRHSGQS 233 

Qy 256 PFLLLMATPLERAQHLHS S - RHRRALDTNYCFSSTDYKDDDDKEKNCCVRQLY I DFRKDL 314 

Mill I I : I I I Mill Ml : = I MM I IMIMM 

Db 234 PHLLLMLLPSYRLESQHKSHRQKRALDAAFCFRNV QDNCCLRSLYIDFKKDL 285 

Qy 315 GWKWIHEPKGYHANFCLGPCPYIWSLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPI 3 74 

1 1 1 1 M 1 1 M I M 1 1 1 I M I M I M I M M Ml II I MM Ml I MM I 

Db 286 GWKWIHEPKGYNANFCAGACPYLWSADTQHSNILGLYNTINPEASASPCCVSQDLEPLTI 345 

Qy 3 75 VYYVGRKPKVEQLSNM 3 90 

Mhh IIMMIM 

Db 346 LY Y I GKT P KI EQLSNM 361 

RESULT 15 
002730 

ID 00273 0 PRELIMINARY; PRT; 112 AA. 

AC 002730; 097501; 

DT 01-JUL-1997 (TrEMBLrel . 04, Created) 

DT 01-JUL-1997 (TrEMBLrel. 04, Last sequence update) 

DT 01-JUN-2002 (TrEMBLrel . 21, Last annotation update) 

DE Transforming growth factor beta 1 (TGF-beta 1) (Fragment) . 

GN TGFB1 OR TGF-BETA-1. 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chorda ta; Crania ta; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986; 

RN [1] 

RP SEQUENCE FROM N . A . 

RA Taylor T.K., James E.R., McGonigle S., Yoho E.R.; 

RL Submitted (APR-1997) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE OF 2-99 FROM N.A. 

RA Inoue K. , Kawabe Y . , Kodama T. ; 

RL Submitted (NOV-1998) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: TGF-BETA 1 IS A MULTIFUNCTIONAL PEPTIDE THAT CONTROLS 

CC PROLIFERATION, DIFFERENTIATION, AND OTHER FUNCTIONS IN MANY CELL 

CC TYPES. MANY CELLS SYNTHESIZE TGF-BETA 1 AND ESSENTIALLY ALL OF 

CC THEM HAVE SPECIFIC RECEPTORS FOR THIS PEPTIDE. TGF-BETA 1 

CC REGULATES THE ACTIONS OF MANY OTHER PEPTIDE GROWTH FACTORS AND 

CC DETERMINES A POSITIVE OR NEGATIVE DIRECTION OF THEIR EFFECTS. 



CC -!- SUBUNIT: HOMODIMER, DI SULF I DE- LINKED (BY SIMILARITY). 

CC -!- SIMILARITY: BELONGS TO THE TGF-BETA FAMILY. 

DR EMBL; AF000133; AAB53806.1; 

DR EMBL; AB020217; BAA36950 . 1 ; -. 

DR HSSP; P01137; 1KLA . 

DR InterPro; IPR002400; GF_cysknot . 

DR InterPro ; IPR001839; TGFb . 

DR Pfam; PF00019; TGF-beta; 1. 

DR PRINTS; PR00438; GFCYSKNOT . 

DR ProDom; PD000357; TGFb; 1. 

DR SMART; SM00204; TGFB; 1. 

DR PROSITE; PS00250; TGF_BETA_1 ; 1. 

KW Growth factor; Mitogen; Glycoprotein. 



FT NON TER 


1 


1 






FT CHAIN 


1 


112 




TRANSFORMING GROWTH FACTOR BETA 1 


FT DISULFID 


7 


16 




BY SIMILARITY. 


FT DISULFID 


15 


78 




BY SIMILARITY. 


FT DISULFID 


44 


109 




BY SIMILARITY. 


FT DISULFID 


48 


111 




BY SIMILARITY. 


FT DISULFID 


77 


77 




INTERCHAIN (BY SIMILARITY) . 


FT CONFLICT 


2 


3 




LD -> FS (IN REF. 2) . 


FT CONFLICT 


85 


92 




PLPIVYYV -> ATAHRVTTL (IN REF. 2) 


SQ SEQUENCE 


112 AA; 


12795 MW 


; 53C5B7D46355A6F3 CRC64 ; 


Query Match 




29. 


.5%; 


Score 624; DB 6; Length 112; 


Best Local Similarity 


93 


.3%; 


Pred. No. 6.6e-48; 



Matches 112; Conservative 0; Mismatches 0; Indels 8; Gaps 1; 

Qy 279 ALDTNYCFSSTDYKDDDDKEKNCCVRQLYIDFRKDLGWKWIHEPKGYHANFCLGPCPYIW 338 

IIMIIIIIII I I I li Ih I I I I I ' I I II il I I I I I I ,11 I I I I I I I I I I II 

Db 1 ALDTNYCFSST EKNCCVRQLYI DFRKDLGWKWIHEPKGYHANFCLGPCPYIW 52 

Qy 339 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 398 

I M I M 1 1 M I Mill: I III 1 , 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 , 1 MM 1 1 1 1 1 1 III I 

Db 53 SLDTQYSKVLALYNQHNPGASAAPCCVPQALEPLPIVYYVGRKPKVEQLSNMIVRSCKCS 112 



Search completed: October 28, 2003, 09:12:25 
Job time : 32.909 sees 



